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Abstract 

This  thesis  studies  the  problem  of  determining  achievable  rates  in  heterogeneous  wire¬ 
less  networks.  We  analyze  the  impact  of  location,  traffic,  and  service  heterogeneity. 
Consider  a  wireless  network  with  n  nodes  located  in  a  square  area  of  size  n  commu¬ 
nicating  with  each  other  over  Gaussian  fading  channels.  Location  heterogeneity  is 
modeled  by  allowing  the  nodes  in  the  wireless  network  to  be  deployed  in  an  arbitrary 
manner  on  the  square  area  instead  of  the  usual  random  uniform  node  placement.  For 
traffic  heterogeneity,  we  analyze  the  n  x  n  dimensional  unicast  capacity  region.  For 
service  heterogeneity,  we  consider  the  impact  of  multicasting  and  caching.  This  gives 
rise  to  the  n  x  2”  dimensional  multicast  capacity  region  and  the  2”  x  n  dimensional 
caching  capacity  region.  In  each  of  these  cases,  we  obtain  an  explicit  information- 
theoretic  characterization  of  the  scaling  of  achievable  rates  by  providing  a  converse 
and  a  matching  (in  the  scaling  sense)  communication  architecture. 
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Chapter  1 


Introduction 


Over  the  past  decades,  there  has  been  a  growing  disconnect  between  the  size  of 
communication  networks  that  are  built  and  planned  and  the  size  of  communication 
networks  that  are  fundamentally  understood.  On  the  one  hand,  wireline  networks 
(like  the  Internet)  have  grown  from  only  a  few  hundred  users  in  1981  to  over  one  billion 
in  2008,  and  wireless  networks  (like  metropolitan  mesh  networks,  sensor  networks,  or 
military  ad-hoc  networks)  with  up  to  a  million  communication  devices  are  being 
envisioned.  On  the  other  hand,  even  simple  communication  networks,  as  for  example 
a  four  node  wireless  network  with  two  sources  and  two  destinations,  or  an  even  simpler 
three  node  network  with  one  source,  one  destination,  and  one  relay,  are  only  partially 
understood.  Central  questions  remain  unanswered:  What  is  the  role  of  interference 
in  the  four  node  example  above,  and  what  is  the  role  of  cooperation  in  the  three  node 
example?  An  answer  to  these  questions  will  undoubtedly  have  profound  implications 
on  the  design  of  future  communication  networks. 

A  main  reason  for  this  disconnect  is  that  much  of  the  effort  analyzing  these  com¬ 
munication  systems  has  been  directed  at  obtaining  exact  solutions  for  small  networks 
and  trying  to  gain  insight  for  larger  networks  from  it.  This  has  proved  challenging, 
as  the  lack  of  a  complete  understanding  of  even  very  simple  networks  like  the  ones 
mentioned  above  illustrates.  Another  approach  is  to  directly  consider  large  networks, 
but  instead  settle  for  an  approximate  asymptotic  solution. 

To  analyze  such  large  networks,  a  model  of  how  they  are  generated  has  to  be 
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chosen.  More  precisely,  consider  a  wireless  network  with  n  nodes.  How  should  the 
location  of  these  n  nodes  be  chosen;  how  should  the  traffic  demand  they  generate 
behave;  and  how  should  the  services  they  require  be  modeled?  This  question  is  usually 
addressed  by  making  several  homogeneity  assumptions.  For  the  node  locations,  it  is 
usually  assumed  that  nodes  are  placed  uniformly  at  random  on  a  square  of  area  n; 
for  the  traffic  demands  that  each  node  is  source  for  exactly  one  destination  node 
chosen  uniformly  at  random  from  among  all  the  other  nodes,  and  that  all  these  n 
source-destination  pairs  communicate  at  equal  rate;  for  the  service  requirements  that 
all  nodes  generate  only  unicast  traffic. 

While  this  homogeneous  setting  is  convenient  mathematically,  it  does  not  provide 
a  very  accurate  model  of  reality.  In  fact,  for  the  node  locations  it  is  likely  that  some 
areas  are  denser  than  others  (e.g.,  towns  vs.  countryside);  for  the  traffic  demands 
that  users  communicate  to  nearby  nodes  more  often  than  to  faraway  ones,  and  that 
some  users  will  create  more  traffic  than  others  (e.g.,  sending  an  email  vs.  streaming 
a  movie);  for  the  service  requirements  that  some  information  needs  to  be  transmitted 
to  several  or  all  nodes.  In  other  words,  we  expect  node  location,  traffic  demands, 
and  service  requirements  to  be  highly  heterogeneous.  Moreover,  these  heterogeneities 
will  lead  to  different  asymptotic  behavior  of  the  network.  This  implies  that  the 
results  obtained  for  large  homogeneous  wireless  networks  will  only  yield  a  limited 
understanding  of  the  heterogeneous  networks  we  are  likely  to  encounter  in  practice. 

In  this  thesis,  we  develop  approximate  asymptotic  characterizations  of  the  perfor¬ 
mance  of  large  heterogeneous  wireless  networks.  We  consider  the  impact  of  location, 
traffic,  and  service  heterogeneity.  The  common  approach  to  deal  with  these  hetero¬ 
geneities  consists  of  first  finding  the  underlying  “coarse  structure”  of  the  network, 
capturing  the  essential  parts  of  the  heterogeneity.  Once  such  a  simple  coarse  struc¬ 
ture  is  identified,  rather  complicated  questions  about  the  network  can  be  elegantly 
analyzed  by  recasting  them  for  the  underlying  coarse  structure.  Moreover,  this  coarse 
structure  allows  to  obtain  insight  into  the  role  of  interference  or  cooperation  in  large 
networks  and  can  guide  the  design  of  communication  schemes  and  algorithms. 
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1.1  Network  Models 


As  mentioned  in  the  previous  section,  to  analyze  large  networks  a  model  for  their 
generation  has  to  be  chosen.  Here  we  briefly  review  some  popular  such  models  used 
throughout  the  literature. 

First,  a  model  for  the  node  location  needs  to  be  chosen.  A  standard  assumption 
is  that  the  n  nodes  of  the  wireless  network  are  located  on  a  square  of  area1  n.  It  is 
often  assumed  that  the  nodes  are  placed  uniformly  and  independently  at  random  on 
this  square,  which  we  refer  to  as  random  node  placement.  If  nodes  are  allowed  to  be 
placed  in  an  arbitrary  deterministic  manner  on  this  square,  we  speak  of  an  arbitrary 
node  placement.  In  the  case  of  arbitrary  node  placement,  it  is  usually  assumed  that 
there  is  some  constant  (independent  of  n)  minimum  separation  between  the  nodes. 
This  minimum-separation  requirement  prevents  degenerate  node  placements. 

Second,  a  model  for  communication  between  these  nodes  needs  to  be  selected. 
There  are  two  broad  categories  of  such  models.  Models  in  the  first  category  are 
motivated  by  current  wireless  technology  and  are  referred  to  as  protocol  models.  We 
describe  two  of  them  in  more  detail. 

Protocol  Model  1:  Node  v  can  receive  data  from  node  u  at  rate  1  bits/s  if  it  lies 
outside  the  region  of  interference  of  each  other  transmitter. 

Protocol  Model  2:  Node  v  can  receive  data  from  node  u  at  rate  log(l  +  SINR) 
bits/s,  where  SINR  is  the  signal  to  interference  plus  noise  ratio  at  the  receiving 
node  v.  Here  signals  are  attenuated  as  r_Q/2  over  distance  r  for  some  path-loss 
exponent  a  >  2. 

These  communication  models  share  two  assumptions.  First,  they  only  allow  point-to- 
point  communication,  and  second,  they  treat  all  interference  as  noise.  In  other  words, 
these  models  makes  assumptions  on  the  communication  protocol  used  between  these 

1This  is  referred  to  as  extended  node  placement.  When  nodes  are  located  on  a  square  of  area  1 
for  any  n,  we  speak  of  a  dense  node  placement.  Results  for  the  two  cases  are  closely  related,  and  we 
focus  on  the  extended  case  in  this  thesis. 
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nodes2.  These  two  assumptions  imply  that  the  only  allowed  communication  scheme 
in  the  wireless  network  is  multi-hop  routing,  in  which  a  message  travels  over  multiple 
hops  from  its  source  to  its  destination,  and  each  node  along  this  route  decodes  the 
message  received  from  the  previous  node  and  then  re-encodes  it  for  the  subsequent 
one. 

Models  in  the  second  category  do  not  make  any  assumptions  about  the  commu¬ 
nication  protocol  used,  but  rather  aim  at  directly  describing  the  underlying  wireless 
channel.  Two  popular  models  are  the  following. 

Gaussian  Model:  Signals  transmitted  at  node  u  are  received  at  node  v  at  distance 
r  attenuated  by  a  factor  r~a /2  for  some  path-loss  exponent  a  >  2,  and  then 
further  corrupted  by  additive  Gaussian  noise. 

Gaussian  Fading  Model:  Signals  transmitted  at  node  u  are  received  at  node  v  at 
distance  r  attenuated  by  a  factor  r~a^2hu^v  for  some  path-loss  exponent  a  >  2, 
and  then  further  corrupted  by  additive  Gaussian  noise.  Here  hUtV  models  small- 
scale  fading  between  the  nodes  u  and  v,  and  is  usually  assumed  to  vary  in  a 
stationary  ergodic  fashion  across  time. 

Third,  a  choice  of  service  requirements  has  to  be  made.  The  simplest  such  service 
requirement  is  unicast  traffic,  in  which  each  message  is  available  at  only  one  source 
node  and  requested  at  only  one  destination  node.  When  each  message  is  only  available 
at  one  source  node,  but  the  same  message  may  be  requested  by  several  destination 
nodes,  we  speak  of  multicast  traffic.  The  extreme  case,  in  which  each  message  needs  to 
be  transmitted  to  all  the  nodes  in  the  network,  is  termed  broadcast  traffic.  Instead  of 
varying  the  number  of  destinations  for  a  given  message,  we  can  also  vary  the  number 
of  sources  that  have  access  to  a  given  message.  We  think  of  these  sources  having 
access  to  the  same  message  as  caches  in  the  network,  replicating  these  messages.  If 
several  sources  have  access  to  the  same  message,  but  each  such  message  needs  to  be 
transmitted  to  only  one  destination  node,  we  speak  of  caching  traffic. 

2More  commonly,  only  the  first  model  is  called  protocol  model.  The  second  model  is  usually 
referred  to  as  generalized  physical  model.  We  use  the  name  protocol  model  for  both  of  them  to 
highlight  that  they  both  make  assumptions  on  the  communication  protocol  used  and  to  contrast 
them  with  the  more  information-theoretic  Gaussian  fading  channel  models  described  in  the  following. 


12 


Fourth,  a  model  for  traffic  generation  is  required.  The  standard  assumption  for 
unicast  traffic  is  that  each  node  is  source  for  exactly  one  other  node,  and  this  des¬ 
tination  node  is  chosen  uniformly  and  independently  at  random  from  among  all  the 
other  nodes.  Moreover,  all  these  n  source-destination  pairs  generate  traffic  at  equal 
rate.  We  refer  to  this  as  random  source- destination  pairing  with  uniform  rate.  The 
corresponding  maximal  achievable  per-node  rate  is  called  the  throughput  capacity  of 
the  wireless  network.  Another  figure  of  merit  for  unicast  traffic  that  is  often  used 
is  the  transport  capacity ,  which  is  the  maximum  achievable  rate-distance  product, 
summed  over  all  source-destination  pairs.  General  unicast  traffic  gives  rise  to  the 
unicast  capacity  region  Auc(n)  C  M+Xn,  which  characterizes  the  set  of  achievable 
rates  for  each  of  the  possible  n 2  source-destination  pairs.  For  multicast  traffic  the 
standard  homogeneity  assumption  is  that  each  node  in  the  network  is  a  source  and 
requires  to  multicast  at  uniform  rate  to  the  same  number  of  destination  nodes  chosen 
uniformly  at  random.  As  before,  general  multicast  traffic  gives  rise  to  the  multicast 
capacity  region  AMC(n)  C  M”x2  ,  which  characterizes  the  set  of  achievable  rates  for 
each  of  the  possible  n  x  2"  pairs  of  source  and  corresponding  group  of  destinations. 
Finally,  achievable  general  caching  traffic  can  be  described  by  the  caching  capacity 
region  A CA(n)  C  M^_nxn,  which  characterizes  the  set  of  achievable  rates  for  each  of  the 
possible  2n  x  n  pairs  of  caches  and  corresponding  destination. 


1.2  Prior  Work 

In  this  section,  we  review  prior  work  on  scaling  laws  for  wireless  networks.  Most 
of  the  literature  on  the  subject  focuses  on  the  homogeneous  setting,  i.e.,  random 
node  placement  and  unicast  traffic  under  random  source-destination  pairing  with 
uniform  rate.  The  literature  pertaining  to  this  homogeneous  setting  is  surveyed  in 
Section  1.2.1.  The  literature  for  arbitrary  node  placement,  in  which  no  probabilistic 
assumptions  are  made  on  the  node  location,  is  reviewed  in  Section  1.2.2.  Prior  work 
considering  more  general  unicast  traffic  patterns  is  discussed  in  Section  1.2.3.  Finally, 
Section  1.2.4  provides  a  literature  survey  for  work  on  service  heterogeneity,  such  as 
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multicast,  broadcast,  and  caching  traffic. 


1.2.1  Homogeneous  Setting 

The  scaling  approach  to  analyzing  wireless  networks  was  pioneered  by  Gupta  and 
Kumar  in  [15].  They  show  that  under  random  node  placement  and  assuming  protocol 
model  1,  the  throughput  capacity  scales  like3  ©((nlog(n))-1/2).  For  protocol  model 
2,  they  prove  an  upper  bound  of  0(n_1/a),  and  a  lower  bound  of  n((nlog(n))-1/2) 
(see  also  [14]).  Achievability  (i.e.,  the  lower  bound  on  the  throughput  capacity) 
is  shown  using  a  straight-line  multi-hop  routing  scheme.  For  protocol  model  1,  a 
simpler  proof  of  the  fl((n  log(n))_1//2)  lower  bound  on  the  throughput  capacity  was 
provided  subsequently  in  [30].  Achievability  is  shown  there  by  multi- hop  routing 
along  a  grid  structure  instead  of  straight  line  routing  proposed  in  [15].  Using  an 
argument  in  [3]  relating  protocol  models  1  and  2,  the  communication  scheme  proposed 
in  [30]  also  applies  to  protocol  model  2.  The  upper  bound  for  protocol  model  2  was 
later  sharpened  to  0(n-1/2)  in  [3].  This  leaves  only  a  order  log~1//2(n)  gap  between 
the  upper  and  lower  bounds  for  protocol  model  2.  This  gap  was  closed  in  [10], 
where  it  is  shown  that,  under  protocol  model  2,  the  throughput  capacity  scales  like 
©(n~1//2).  To  summarize,  under  protocol  model  1  the  throughput  capacity  scales  like 
©((nlog(n))-1/2),  and  under  protocol  model  2  the  throughput  capacity  scales  like 
©(n-1/2). 

The  results  mentioned  in  the  last  paragraph  show  that  under  a  protocol  model 
assumption,  the  maximum  achievable  per-node  rate  with  random  source-destination 
pairing  decays  to  zero  essentially  as  ©(n-1/2).  However,  this  result  was  derived  by 
restricting  communication  schemes  to  just  multi-hop  routing  (by  making  the  proto¬ 
col  model  assumption).  While  such  a  restriction  is  motivated  by  current  technology, 
it  is  not  clear  that  multi-hop  communication  is  optimal  for  large  wireless  networks. 
To  make  claims  about  the  performance  of  wireless  networks  under  any  communica¬ 
tion  scheme,  a  more  information-theoretic  approach  using  a  Gaussian  channel  model 
(either  with  or  without  fading)  is  necessary. 

3We  use  Knuth’s  asymptotic  notation.  See  Section  2.1  for  a  formal  definition. 
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Since  any  communication  scheme  for  the  protocol  models  is  also  a  communication 
scheme  for  the  Gaussian  channel  models  achieving  the  same  order  rate,  we  obtain  from 
the  results  mentioned  above  that  under  both  Gaussian  channel  models  (i.e.,  with  or 
without  fading)  throughput  capacity  is  lower  bounded  by  12  (n-1/2).  The  work  on 
scaling  laws  under  the  Gaussian  channel  models  can  be  grouped  into  two  streams. 
One  stream  of  work  [4,21,32,37,38,48,50,51]  focused  on  progressively  broadening 
the  conditions  on  the  channel  model,  under  which  multi-hop  communication  is  indeed 
order  optimal,  and  hence  throughput  capacity  is  also  upper  bounded  by  0(n_1,/2). 
Another  stream  of  work  [1,16,28,38,49]  focused  on  progressively  more  refined  multi¬ 
user  cooperative  schemes,  which  are  shown  to  significantly  out-perform  multi-hop 
communication  in  certain  environments,  hence  improving  the  f lfn-1//2)  lower  bound 
on  the  throughput  capacity. 

For  the  upper  bounds,  it  was  argued  in  [48]  that  for  the  Gaussian  channel  model 
with  path-loss  exponents  a  >  6  (i.e.,  signal  power  decays  quickly  as  a  function  of 
distance),  throughput  capacity  is  upper  bounded  by  0(n-1/2),  and  hence  multi-hop 
communication  is  order  optimal  in  this  regime.  This  result  was  later  extended  for 
the  Gaussian  fading  channel  model  in  [51].  A  sharper  bound  was  found  subsequently 
in  [21],  where  it  is  shown  that  under  both  Gaussian  channel  models  (with  or  without 
fading),  the  same  upper  bound  holds  for  a  >  5.  In  [32]  it  is  shown  that  under 
a  Gaussian  channel  model,  even  for  a  G  (2, 5]  the  throughput  capacity  is  upper 
bounded  by  log(n)) .  While  this  does  not  prove  the  order  optimality  of 

multi-hop  communication,  it  does  show  that  the  throughput  capacity  must  decay 
to  zero  as  n  —>  oo.  The  threshold  above  which  multi-hop  communication  is  order 
optimal  was  further  reduced  to  a  >  4.5  in  [4],  and  to  a  >  4  in  [50]  for  both  Gaussian 
channel  models.  For  the  Gaussian  fading  channel  model,  it  is  shown  in  [38]  that  the 
throughput  capacity  is  O^n^1/2"1"2)  for  a  >  3  and  0(n1~Q/2+£)  for  a  G  (2,3]  for  any 
£  >  0.  Hence  multi-hop  is  order  optimal  in  the  sense  of  achieving  the  best  scaling 
exponent  for  a  >  3.  For  the  Gaussian  channel  model  without  fading,  [37]  shows  that 
for  a  G  (2,4],  throughput  capacity  is  upper  bounded  by  0{v}^a+^~1^2  log3(n)).  All 
these  results  rely  on  the  cut-set  bound  to  upper  bound  the  sum  rate  across  a  cut 
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by  the  capacity  of  a  multiple-input  multiple-output  (MIMO)  point-to-point  channel 
in  which  all  the  nodes  on  one  side  of  the  cut  are  allowed  to  cooperate  in  sending 
a  message  and  all  nodes  on  the  other  side  of  the  cut  are  allowed  to  cooperate  in 
receiving  this  message.  They  differ,  however,  in  their  analysis  of  this  MIMO  channel. 

For  the  lower  bounds,  it  was  first  argued  in  [16]  that  there  exists  a  (carefully  con¬ 
structed)  node  placement  such  that  under  either  Gaussian  channel  model  (i.e.,  with 
or  without  fading)  higher  rates  than  suggested  by  the  results  for  the  protocol  models 
are  achievable.  This  node  placement  consists  of  two  clusters  each  containing  half  the 
nodes.  In  the  communication  scheme  proposed  in  [16],  the  nodes  in  the  first  cluster 
exchange  all  their  messages  among  themselves  and  then  jointly  encode  and  transmit 
them.  The  first  part  can  be  carried  out  efficiently  since  the  all  the  nodes  are  located 
close  to  each  other.  Similarly,  the  nodes  in  the  second  cluster  exchange  their  received 
observations  and  then  jointly  decode  them.  This  procedure  effectively  transforms  the 
network  into  a  distributed  MIMO  channel.  Similar  distributed  cooperative  schemes 
were  also  suggested  in  [28,49].  While  the  results  in  [16]  hold  only  for  a  particular 
node  placement,  it  is  shown  in  [1]  that  a  similar  approach  can  also  be  used  under 
random  node  placement.  However,  since  the  nodes  are  now  less  clustered,  setting  up 
the  distributed  MIMO  channel  incurs  a  loss.  In  [38]  it  is  shown  that  this  loss  can 
be  circumvented  by  using  the  scheme  proposed  in  [1]  multiple  times  in  a  hierarchical 
fashion.  More  precisely,  the  problem  of  setting  up  the  distributed  MIMO  channel  is 
recognized  as  being  essentially  the  same  as  the  original  communication  problem,  but 
at  a  smaller  scale.  Using  the  same  scheme  recursively,  we  can  thus  reduce  this  scale 
to  a  point  where  the  penalty  of  setting  up  the  initial  distributed  MIMO  channel  is 
negligible.  Analyzing  this  scheme  yields  that  for  the  Gaussian  fading  channel  model 
and  a  G  (2,3),  the  throughput  capacity  is  lower  bounded  by  for  any 

£  >  0.  This  matches  the  upper  bound  derived  in  the  same  paper  up  to  £  (see  the 
previous  paragraph),  and  hence  establishes  that  the  throughput  capacity  scales  like 
©(n1^"/2^)  in  this  regime. 

To  summarize,  under  the  Gaussian  channel  model  with  fading,  the  throughput 
scales  essentially  like  0  (n1-mm{3>a}/2±e)  for  any  a  >  2  (with  improvements  on  the  ±£ 
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Model 

Throughput  Capacity 

Protocol  model  1 

@((n  log(n))-1/2) 

Protocol  model  2 

@(n-1/2)  for  a  >  2 

Gaussian  with  fading 

@(n-1/2)  for  a  >  4 

G(n-1/2)  for  a  G  (2,4] 

O^n1/^4-8)-1/2  log3(n))  for  a  G  (2,4] 

Gaussian  without  fading 

@(n-1/2)  for  a  >  4 

n(n-1/2)  for  a  G  (3,4] 

0(n"1//2+£)  for  a  G  (3,4]  and  any  £  >  0 

0(n1_a/2±£)  for  a  G  (2,  3)  and  any  £  >  0 

Table  1.1:  Summary  of  scaling  results  for  the  throughput  capacity  in  random  wireless 
networks  under  various  communication /channel  models. 

possible  for  a  >  3).  Under  the  Gaussian  channel  model  without  fading  the  throughput 
capacity  scales  like  0  (n~1//2)  for  a  >  4,  and  it  is  lower  bounded  by  G  (n_1//2)  and  upper 
bounded  by  0(n1/(a+8)-1/2  log3(n))  for  a  G  (2,4).  The  results  reviewed  so  far  for  all 
communication  and  channel  models  are  listed  in  Table  1.1. 

1.2.2  Location  Heterogeneity 

Location  heterogeneity  is  usually  modeled  by  allowing  arbitrary  deterministic  node 
placement  with  a  minimum-separation  requirement.  For  protocol  models  1  and  2, 
such  arbitrary  node  placement  can  be  analyzed  by  converting  the  wireless  network 
into  an  equivalent  (wireline)  graph,  capturing  which  nodes  can  communicate  with 
each  other,  and  a  set  of  constraints  on  the  edges  that  can  simultaneously  transmit 
data,  capturing  the  communication  constraints  of  the  channel  model.  This  approach 
is  taken  in  [35],  building  on  results  on  achievable  rates  in  wireline  graphs  by  [31].  For 
protocol  models  1  and  2,  this  yields  a  computable  characterization  of  the  capacity 
scaling  for  a  fairly  general  set  of  traffic  models. 

The  situation  is  more  complicated  under  the  Gaussian  channel  models.  Some 
results  on  the  scaling  of  the  transport  capacity  (i.e.,  the  maximum  achievable  rate- 
distance  product  summed  over  all  source-destination  pairs)  under  arbitrary  node 
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placement  are  known.  In  [48],  it  is  shown  that  under  such  node  placement  and  using  a 
Gaussian  channel  model  without  fading,  the  transport  capacity  is  upper  bounded  by 
0{n)  for  a  >  6.  For  Gaussian  fading  channels,  the  same  behavior  was  shown  to  hold 
for  a  >  6  in  [51].  Under  both  Gaussian  channel  models  the  same  0{n )  upper  bound 
on  the  transport  capacity  was  argued  to  hold  for  a  >  5  in  [21],  for  a  >  4.5  in  [4],  and 
for  a  >  4  in  [50].  Matching  lower  bounds  are,  however,  usually  only  available  under 
stricter  conditions  on  the  node  placement.  In  [51],  it  is  shown  that  the  transport 
capacity  is  also  lower  bounded  by  Q(n)  for  any  a  >  2  if  the  node  placement  is  such 
that  for  every  node  at  least  one  other  node  is  within  distance  ©(1).  The  unicast 
traffic  that  achieves  this  lower  bound  pairs  each  node  with  its  nearest  neighbor  into 
a  source-destination  pair,  and  all  these  n  pairs  communicate  at  equal  rate. 

1.2.3  Traffic  Heterogeneity 

As  mentioned  in  the  previous  section,  under  protocol  models  1  or  2,  the  wireless  net¬ 
work  can  be  transformed  into  an  equivalent  wireline  graph.  This  is  used  in  [35]  to 
analyze  more  general  traffic  patterns.  The  authors  consider  product  multicommodity 
flows,  in  which  each  source-destination  pair  ( u ,  v )  wants  to  communicate  at  rate  7ru7r„, 
where  {7T„}  are  arbitrary  nonnegative  numbers.  For  such  traffic  patterns,  achievable 
rates  scale  like  the  conductance  of  the  equivalent  wireline  graph  [35].  Another  ap¬ 
proach  is  to  consider  the  transport  capacity  of  the  wireless  network.  The  transport 
capacity  upper  bounds  every  achievable  rate-distance  product  summed  over  all  source- 
destination  pairs.  As  such  it  provides  an  upper  bound  on  the  transport  rate  for  any 
achievable  unicast  traffic  matrix.  In  other  words,  the  transport  capacity  provides  a 
hyperplane  that  contains  the  capacity  region  and  origin  on  the  same  side.  Through  a 
repeated  application  of  this  transport  capacity  bound  at  different  scales,  [42,43]  ob¬ 
tained  an  implicit  characterization  of  the  unicast  capacity  region  under  a  simplified 
version  of  protocol  model  1.  Achievability  is  shown  in  [43]  using  a  localized  variant 
of  the  two-phase  Valiant- Brebner  routing  scheme  developed  in  [47]. 

For  the  Gaussian  channel  models,  asymptotic  upper  bounds  on  the  transport  ca¬ 
pacity  were  obtained  in  [21,48,50,51].  However,  as  was  discussed  in  the  last  paragraph, 
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the  transport  capacity  provides  only  partial  information  about  the  unicast  capacity  re¬ 
gion.  Generalized  transport  capacities,  in  which  the  rate  between  a  source-destination 
pair  at  distance  r  is  weighted  by  f(r)  for  some  function  /  are  analyzed  in  [4],  These 
generalized  transport  capacities  provide  tighter  outer  bounds  on  the  unicast  capacity 
region. 

1.2.4  Service  Heterogeneity 

So  far,  we  have  discussed  only  unicast  traffic.  More  general  service  requirements 
(such  as  broadcast,  multicast,  or  caching  traffic)  have  recently  attracted  attention. 
In  [46],  broadcasting  under  (a  simplified  version  of)  protocol  model  1  is  studied.  It  is 
shown  that  under  random  node  placement  the  maximal  per-node  rate  at  which  every 
node  can  simultaneously  broadcast  information  in  the  network  is  upper  bounded 
by  0(n_1).  The  same  problem  is  analyzed  under  protocol  model  2  in  [52],  and  it  is 
shown  that  the  maximal  achievable  per-node  rate  scales  like  ©(n^1  log_“^2(n)).  More 
general  broadcast  traffic,  in  which  each  node  broadcasts  data  at  different  (possibly 
zero)  rates,  have  been  studied  in  [23,24],  where  it  is  shown  that  general  broadcast 
traffic  is  achievable  if  and  only  if  its  sum  rate  scales  like  ©(l)  for  protocol  model  1  or 
like  ©(  log_Q/,2(n))  for  protocol  model  2.  In  other  words,  the  only  relevant  quantity 
when  analyzing  broadcast  traffic  is  the  sum  rate.  This  is  because  the  broadcast 
requirement  induces  a  uniform  received  traffic  pattern,  even  if  the  transmitted  traffic 
pattern  is  not  (i.e. ,  all  nodes  are  required  to  receive  information  at  the  same  rate).  An 
information-theoretic  approach  to  the  problem  was  taken  in  [40]  to  analyze  broadcast 
from  a  single  source  under  random  node  placement  and  assuming  a  Gaussian  fading 
channel  model.  The  maximal  achievable  broadcast  rate  for  the  source  is  shown  to 
be  upper  bounded  by  0(loglog(n))  and  lower  bounded  by  fl(l).  Achievability  (i.e., 
the  lower  bound)  is  proved  using  a  cooperative  multistage  scheme.  In  the  first  stage, 
the  message  is  transmitted  by  the  source.  In  the  second  stage,  nodes  that  were 
able  to  decode  the  sent  message  successfully,  cooperatively  retransmit  the  message. 
The  scheme  continues  in  the  same  fashion  until  all  nodes  have  correctly  decoded  the 
message.  Similar  cooperative  schemes  for  broadcast  over  Gaussian  fading  channels 
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have  also  been  studied  in  [19,26]. 

The  analysis  of  multicast  traffic  is  considerably  more  difficult.  For  random  node 
placement  and  protocol  model  1,  the  maximal  uniformly  achievable  per-node  rate  for 
multicast  from  nP  (with  (3  G  (0, 1))  randomly  selected  source  nodes  to  the  remaining 
n1_/3  nodes  in  the  network  has  been  shown  to  scale  like  © ((n/3 log(n))”1/2)  in  [39]. 
Under  the  same  assumptions  (but  with  a  simplified  variant  of  protocol  model  1),  it 
was  shown  in  [33]  that  when  each  node  wants  to  multicast  at  uniform  rate  to  rh3  (with 
f3  G  (0, 1))  randomly  chosen  destinations,  the  maximal  achievable  per-node  rate  scales 
like  @((n1+/i  log(n))-1/2) .  This  was  subsequently  generalized  to  protocol  model  2  by 
[25] ,  where  it  is  shown  that  under  the  same  traffic  and  node  placement  assumptions  as 
in  [33],  the  maximal  achievable  per  node-rate  scales  like  @((n1+/3)-1/2).  Achievability 
in  [25]  is  shown  using  the  scheme  of  [10],  and  the  same  log_1,/2(n)  gap  can  be  observed 
between  the  results  for  protocol  models  1  and  2,  just  as  in  the  unicast  case.  To  the 
best  of  our  knowledge,  the  scaling  of  achievable  multicast  rates  has  not  been  studied 
from  an  information-theoretic  point  of  view  using  either  of  the  Gaussian  channel 
models. 

The  analysis  of  caching  traffic  can  be  separated  into  two  distinct  problems.  In 
the  cache  selection  problem,  we  are  given  a  set  of  caches  and  are  interested  in  op¬ 
timally  selecting  caches  for  each  destination  node  and  the  resulting  performance  of 
the  network.  In  the  cache  placement  problem,  we  are  interested  in  optimally  placing 
the  caches  in  order  to  maximize  the  performance  of  the  network.  Most  of  the  prior 
work  on  caching  focuses  on  the  second  problem  and  sidesteps  the  first  one  by  making 
two  assumptions.  First,  the  wireless  network  is  modeled  by  a  (possibly  capacitated) 
graph,  and  second,  each  destination  node  requests  the  entire  message  from  the  closest 
(with  respect  to  the  graph  distance)  node.  For  arbitrary  graphs,  the  cache  placement 
problem  can  then  be  formulated  as  a  variant  of  the  classical  facility  location  problem 
(see,  e.g.,  [6,29]  and  references  therein).  In  the  context  of  wireless  networks,  this 
problem  has  been  studied  in  [5,20,27,36,44,45],  with  the  wireless  network  modeled 
as  a  graph  induced  by  a  simplified  version  of  protocol  model  1.  More  precisely,  con¬ 
stant  factor  approximation  algorithms  for  optimal  cache  placement  for  one  message 
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under  different  communication  constraints  are  proposed  in  [36,44],  Constant  factor 
approximation  algorithms  for  multiple  messages  under  different  memory  constraints 
are  proposed  in  [27,45],  Scaling  results  for  the  cache  placement  problem  are  presented 
in  [20],  which  derives  asymptotically  optimal  cache  densities  assuming  random  node 
placement  and  uniform  traffic,  and  in  [5],  which  analyzes  the  resulting  scaling  of 
achievable  rates.  As  mentioned  before,  the  results  on  caching  traffic  surveyed  in  this 
paragraph  model  the  wireless  network  as  a  graph  and  assume  nearest-neighbor  cache 
selection.  Hence  they  address  only  the  cache  placement  problem  while  avoiding  the 
cache  selection  problem. 

Caching  in  wireless  networks  has  not  been  directly  considered  in  the  information 
theory  literature.  However,  it  can  be  seen  that  the  cache  selection  problem  is  a 
special  case  of  the  problem  of  communicating  correlated  sources  over  a  noisy  network. 
Indeed,  we  can  consider  that  each  cache  has  an  identical  message  to  send  to  the  same 
destination.  The  more  general  problem  of  transmitting  correlated  sources  over  noisy 
networks  has  received  considerable  attention.  Unlike  the  situation  with  point-to-point 
communication,  for  network  communication  problems  source-channel  separation  does 
not  hold  in  general  [8].  Hence,  the  problem  of  source  and  channel  coding  have  to  be 
considered  jointly.  While  for  some  special  cases  optimal  communication  strategies 
for  transmitting  correlated  sources  over  a  noisy  network  are  known  (for  example, 
broadcast  from  a  single  source  with  independent  network  links  [7,17]),  the  general 
problem  is  unsolved. 


1.3  Thesis  Outline 

This  thesis  considers  the  impact  of  different  heterogeneities  on  achievable  rates  in  a 
wireless  network.  Throughout,  we  assume  the  Gaussian  fading  channel  model.  Our 
treatment  is  information  theoretic  and  hence  allows  claims  to  be  made  about  the 
performance  of  wireless  networks  under  any  communication  scheme  consistent  with 
the  model. 
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1.3.1  Location  Heterogeneity 


As  mentioned  earlier,  the  standard  homogeneity  assumption  for  the  location  of  nodes 
is  that  they  are  placed  independently  and  uniformly  at  random  on  a  square  of  area  n. 
However,  in  many  situations  this  might  not  be  a  good  model  of  reality.  A  more  general 
assumption  is  to  allow  for  arbitrary  node  placement  with  a  constant  (independent  of 
n)  minimum  separation  between  nodes.  We  adopt  this  model  to  study  the  effect 
of  location  heterogeneity  on  the  scaling  of  achievable  rates.  To  study  this  effect  in 
isolation,  we  keep  the  homogeneity  assumptions  for  traffic  and  service  requirements, 
i.e.,  we  assume  unicast  traffic  induced  by  random  source-destination  pairing  with 
uniform  rate. 

Several  complications  arise  due  to  the  introduction  of  location  heterogeneity.  As 
we  have  seen  in  Section  1.2.1,  in  the  homogeneous  case  the  optimal  communica¬ 
tion  scheme  depends  crucially  on  the  path-loss  exponent  a:  For  a  >  3,  multi-hop 
communication  is  order  optimal,  whereas  for  a  G  (2,  3]  hierarchical  cooperative  com¬ 
munication  is  order  optimal.  The  first  complication  is  that  under  arbitrary  node 
placement  these  schemes  might  either  be  clearly  suboptimal  or  not  even  be  implc- 
mentable  at  all.  As  an  example,  consider  the  two  node  placements  in  Figure  1-1.  The 
left  node  placement  shows  infeasibility  of  hierarchical  cooperation  under  arbitrary 
node  placement.  Recall  from  Section  1.2.1  that  the  hierarchical  cooperation  scheme 
operates  by  setting  up  distributed  MIMO  transmitter  and  receiver  clusters  at  increas¬ 
ingly  bigger  scales.  In  other  words,  the  neighbors  of  each  source  help  transmitting, 
and  the  neighbors  of  each  destination  help  receiving  the  message.  Consider  now  the 
source-destination  pair  (u,w)  in  the  left  node  placement  in  Figure  1-1.  These  nodes 
are  both  isolated,  i.e.,  they  have  no  immediate  neighbors.  Hence  neither  the  trans¬ 
mitter  MIMO  cluster  nor  the  receiver  MIMO  cluster  can  be  effectively  constructed. 
The  right  node  placement  shows  suboptimality  of  multi-hop  communication.  In  this 
example,  half  of  the  nodes  are  placed  on  the  left  of  the  square  area,  the  other  half  on 
the  right.  The  gap  between  these  two  node  clusters  is  of  order  Consider  the 

source-destination  pair  (u,  w )  in  this  figure.  For  a  multi-hop  communication  scheme, 
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Figure  1-1:  Example  node  placements  showing  the  infeasibility  of  hierarchical  cooper¬ 
ation  (left)  and  the  suboptimality  of  multi-hop  communication  (right)  under  arbitrary 
node  placement. 

one  of  the  hops  will  have  to  cross  the  gap  between  the  two  node  clusters.  Since  this 
gap  is  large,  this  hop  will  only  be  able  to  carry  data  at  low  data  rates.  More  precisely, 
for  a  hop  of  size  ©(y^),  the  largest  data  rate  achievable  scales  like  ©(n-Q/2).  Since 
under  random  source-destination  pairing  a  constant  fraction  of  nodes  will  have  to 
communicate  across  this  gap4,  the  maximal  uniformly  achievable  per-node  rate  un¬ 
der  multi-hop  communication  is  at  most  0(n_Q/2)  —  much  worse  than  the  flfVi-1/2) 
scaling  achievable  with  multi-hop  under  random  node  placement,  especially  for  large 
a. 

To  address  the  infeasibility  of  the  hierarchical  cooperative  communication  scheme 
under  arbitrary  node  placement,  we  propose  a  different  hierarchical  scheme,  which  we 
call  hierarchical  relaying.  We  show  that  this  scheme  achieves  the  same5 
scaling  of  the  per-node  rate  as  hierarchical  cooperation,  but  requires  no  uniformity 
in  the  node  placement.  In  particular,  this  scheme  can  be  successfully  applied  to 
the  node  placement  in  the  left  of  Figure  1-1.  We  also  show  that  for  a  E  (2,3] 
the  same6  0(  log6(n)n1-“/2)  upper  bound  on  the  per-node  rate  (proved  in  [38]  for 

4For  this  argument,  it  is  enough  if  even  a  single  source-destination  pair  has  to  cross  the  gap. 

5Note  that  the  scaling  of  the  lower  bound  obtained  here  is  actually  slightly  better,  i.e. ,  n1-“/2-o(d) 
here  compared  to  fi(n1_a/2_£)  for  arbitrary  small,  but  constant,  e  >  0  in  [38]. 

6Again,  the  scaling  of  the  upper  bound  obtained  here  is  slightly  better,  i.e.,  0(  log6(n)n1_a/2) 
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the  homogeneous  case)  is  also  valid  under  arbitrary  node  placement.  Together,  this 
answers  the  question  of  scaling  of  the  throughput  capacity  under  arbitrary  node 
placement  for  the  low  path-loss  exponent  regime  a  G  (2,3]. 

As  we  argued  in  the  last  paragraph,  for  a  G  (2,  3]  the  node  placement  has  no  im¬ 
pact  on  the  scaling  performance.  The  situation  is  markedly  different  for  large  path-loss 
exponent  a  >  3.  We  introduce  a  regularity  parameter,  measuring  on  a  coarse  level  the 
uniformity  of  the  node  placement.  We  show  how  the  scaling  of  throughput  capacity 
depends  on  this  regularity  parameter.  The  proposed  order  optimal  communication 
scheme  smoothly  “interpolates”  from  multi-hop  communication  (which  is  order  opti¬ 
mal  under  uniform  node  placement)  to  hierarchical  relaying  (which  is  order  optimal 
under  completely  irregular  node  placement)  depending  on  this  regularity  parameter. 
As  an  example,  we  show  that  for  the  node  placement  on  the  right  in  Figure  1-1,  the 
order  optimal  communication  scheme  is  hierarchical  relaying,  achieving  a  per-node 
rate  of  nWQ/2-0d).  This  contrasts  with  the  performance  of  multi-hop  communication, 
which  is  yields  a  per-node  rate  of  at  most  0(n_Q,/2). 

As  mentioned  in  the  introduction,  the  common  approach  to  deal  with  hetero¬ 
geneities  consists  of  identifying  the  underlying  “coarse  structure”  of  the  network, 
capturing  the  essential  parts  of  the  heterogeneity.  The  coarse  structure  of  the  wire¬ 
less  network  in  this  case  (for  a  >  3)  is  a  wireline  noiseless  grid  graph  where  each  node 
in  the  grid  corresponds  to  a  cluster  of  nodes  in  the  wireless  network  with  cluster  size 
depending  on  the  regularity  of  the  node  placement.  This  coarse  structure  explicitly 
captures  the  amount  of  cooperation  that  is  required  as  a  function  of  the  regularity  of 
the  node  placement. 

Location  heterogeneity  is  discussed  in  detail  in  Chapter  3. 

1.3.2  Traffic  Heterogeneity 

As  was  discussed  in  Section  1.1,  the  standard  homogeneity  assumption  for  traffic 
generation  is  that  each  node  is  source  exactly  once  and  wants  to  transmit  data  at 
uniform  rate  to  a  destination  node  chosen  uniformly  at  random  from  among  the  other 
here  compared  to  0(rW“/2+e)  for  any  (constant)  e  >  0  in  [38]. 
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nodes.  This  kind  of  traffic  pattern  has  several  problematic  characteristics.  The  first 
one  is  uniformity  of  traffic  (i.e.,  all  source-destination  pairs  want  to  communicate  at 
the  same  rate).  Most  traffic  patterns  observed  in  large  networks  (say  the  Internet) 
are  quite  different,  in  that  they  have  a  large  number  of  users  generating  little  traffic, 
and  a  small  number  of  users  generating  a  lot  of  traffic.  Traffic  variations  of  this  kind 
are  not  captured  by  the  homogeneous  traffic  assumption.  The  second  characteristic  of 
the  homogeneous  traffic  assumption  is  that,  since  each  node  chooses  a  destination  at 
random,  most  source-destination  pairs  will  be  at  a  distance  of  ©(y7^).  This  is  again 
unlike  the  situation  in  actual  networks  where  communication  patterns  are  likely  to 
be  more  localized.  The  third  characteristic  of  such  traffic  patterns  is  that  each  node 
is  source  exactly  once  and  is  destination  at  most  a  few  times.  Situations  in  which  we 
have  a  server  that  needs  to  transmit  data  to  many  nodes,  or  a  user  downloading  data 
from  many  other  nodes  cannot  be  accommodated  under  this  assumption. 

To  overcome  these  limitations,  we  turn  to  general  unicast  traffic.  In  other  words, 
we  are  interested  in  the  entire  unicast  capacity  region  Auc(n)  C  M”xn.  To  study 
the  effect  of  traffic  heterogeneity  in  isolation,  we  assume  random  node  placement. 
As  always,  we  assume  a  Gaussian  fading  channel  model.  While  outer  bounds  on 
the  unicast  capacity  region  Auc(n)  can  be  derived  from  results  on  transport  capacity 
reviewed  in  Section  1.2.2,  these  bounds  are  quite  simple  in  that  they  only  provide 
one  hyperplane  containing  the  capacity  region  and  the  origin  on  one  side,  and  they 
do  not  provide  a  scaling  characterization  of  Auc(n).  The  situation  is  worse  for  inner 
bounds,  where  except  for  some  special  points  (as  the  one  resulting  from  homogeneous 
traffic)  not  much  is  known. 

In  this  thesis,  we  find  inner  and  outer  bounds  on  the  rr-dimensiorial  unicast 
capacity  region.  These  bounds  behave  asymptotically  in  the  same  way  along  at  least 
n2  —  n  of  the  total  n2  dimensions  for  a  E  (2,  5],  and  for  all  n2  dimensions  for  a  >  5. 
Hence  they  determine  the  scaling  behavior  of  either  most  (for  a  E  (2,5])  or  all  (for 
a  >  5)  of  the  unicast  capacity  region  Auc(n).  More  precisely,  we  define  two  sets 
A^c(n),  A2C(n)  C  M"xn,  coinciding  along  at  least  n2  —  n  dimensions.  We  show  that 
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for  a  £  (2,  5], 


n  °(1)A!jJC(n)  C  Auc(n)  C  0(log6(n))A2C(n), 

and  for  a  >  5, 

n_o(1)AljJC(n)  C  Auc(n)  C  0(log6(n))A^c(n). 

In  words,  for  a  >  5,  if  we  shrink  A^c(n)  by  a  small  (in  the  scaling  sense)  factor,  we 
obtain  an  inner  bound  to  the  capacity  region.  If  we  grow  A^c(n)  by  a  small  (again  in 
the  scaling  sense)  factor,  we  obtain  an  outer  bound.  Thus  A^c(n)  scales  like  Auc(n). 
The  same  statement  is  true  for  a  £  (2,  5]  for  n2  —  n  out  of  n2  dimensions  of  Auc(n). 
This  characterization  allows  for  analysis  of  the  asymptotic  behavior  of  the  wireless 
network  under  general  unicast  traffic. 

Note  that  the  set  Auc(n)  is  large  (n2  dimensional)  and  could  in  general  be  rather 
difficult  to  describe.  Indeed,  descriptions  of  feasible  rates  are  usually  given  in  terms 
of  cut-set  bounds  that  constrain  the  sum  rate  of  subsets  of  nodes.  Potentially  there 
are  2n  such  subsets,  which  would  result  in  a  very  complicated  characterization  of 
Auc(n).  However,  we  show  that  the  bounds  A^c(n)  and  A2C(n)  can  be  described 
approximately  using  only  2 n  cuts.  More  precisely,  A)JC(n)  and  A^n)  are  polytopes 
with  at  most  2 n  faces,  each  one  of  them  corresponding  to  some  cut-set  bound.  This 
shows  that,  even  though  the  description  complexity  of  Auc(n)  is  likely  to  be  of  order 
@(2n),  the  description  complexity  of  its  approximation  A^n)  and  A2C(n)  is  only  of 
order  0(n)  —  a  logarithmic  reduction  in  description  complexity! 

The  coarse  structure  capturing  traffic  heterogeneity  is  a  noiseless  wireline  tree 
graph.  The  leaves  of  this  tree  correspond  to  the  nodes  in  the  wireless  network,  in¬ 
termediate  nodes  in  this  tree  correspond  to  various  levels  of  cooperation  within  the 
wireless  network.  This  coarse  tree  structure  makes  explicit  the  interaction  between 
traffic  demands  and  the  amount  of  cooperation  in  the  wireless  network  that  is  needed 
to  satisfy  those  demands. 

Traffic  heterogeneity  is  discussed  in  detail  in  Chapter  4. 
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1.3.3  Service  Heterogeneity 


While  unicast  traffic  as  discussed  in  the  last  two  sections  describes  a  broad  class  of 
traffic,  in  several  applications  multicast  is  the  dominating  mode  of  communication. 
In  multicast  traffic  each  source  node  wants  to  transmit  its  information  to  a  group  of 
destinations.  Here  we  are  interested  in  general  multicast  traffic,  i.e.,  the  multicast 
capacity  region  AMC(n)  C  M”x2  .  As  in  the  last  section,  we  assume  random  node 
placement.  As  mentioned  in  Section  1.2.4,  so  far  the  only  results  available  for  multi¬ 
cast  traffic  are  under  a  protocol  model  assumption  and  for  homogeneous  traffic  (i.e., 
each  node  is  source  once  and  wants  to  communicate  to  the  same  number  of  randomly 
chosen  destinations  at  uniform  rate). 

In  this  thesis,  we  find  inner  and  outer  bounds  on  the  n  x  2n-dimensional  multi¬ 
cast  capacity  region  AMC(n)  under  a  Gaussian  fading  channel  model.  These  bounds 
coincide  up  to  scaling  for  n2n  —  n  out  of  n2n  dimensions  for  a  G  (2,  5]  and  for  all 
n2n  dimensions  for  a  >  5.  Hence  they  determine  the  scaling  behavior  of  either  most 
(for  a  G  (2,  5])  or  all  (for  a  >  5)  of  the  multicast  capacity  region.  More  precisely,  we 
define  two  sets  Aj/ic(n),  A^ic(n)  C  M”x2  ,  coinciding  along  at  least  n2n—n  dimensions. 
We  show  that  for  a  G  (2,  5], 

n-°(i)Afc(„)  c  AMC(n)  C  0(log6(n))A2MC(n), 

and  for  a  >  5, 

n-°(i)A“c(„)  c  AMC(n)  C  0(log6(n))A^c(n), 

For  a  >  5,  this  provides  a  scaling  characterization  of  the  entire  multicast  capacity 
region,  and  the  same  statement  holds  for  a  G  (2,  5]  along  at  least  n2n  —  n  dimensions. 
As  before,  we  show  that  the  approximations  Aj/ic(n)  and  A^/lc(n)  of  the  multicast 
capacity  region  AMC(n)  are  described  completely  by  considering  only  2 n  out  of  2n 
possible  cut-set  bounds.  We  again  make  use  of  the  coarse  structure  of  the  wireless 
network  developed  for  traffic  heterogeneity  (see  Section  1.3.2). 

In  a  similar  manner,  one  can  analyze  the  effect  of  caching  traffic,  in  which  a 
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destination  node  can  obtain  the  same  information  from  a  group  of  caches.  In  other 
words,  we  are  interested  in  the  caching  capacity  region  ACA(n)  C  xn.  We  define  a 
set  ACA(n)  C  M+"xn  such  that  for  a  >  6, 

n~°^ ACA(n)  C  ACA(n)  C  0(log6(n))ACA(n), 

providing  a  scaling  characterization  of  the  complete  caching  capacity  region  in  the 
large  path-loss  regime.  Unlike  the  case  for  AljJC(n)  and  Aj/ic(n),  the  caching  capacity 
region  cannot  be  accurately  approximated  by  fewer  than  2n  cut-set  bounds.  However, 
we  show  that  ACA(n)  is  nevertheless  a  manageable  expression,  in  that  approximate 
achievability  of  caching  traffic  can  be  evaluated  in  polynomial  time  in  the  description 
length  of  caching  traffic  matrix  ACA  (i.e.,  ACA  G  A CA(n)  can  be  checked  efficiently  even 
for  large  networks). 

The  characterization  of  the  caching  capacity  region  ACA(n)  provides  a  complete 
(approximate)  solution  to  the  cache  selection  problem  mentioned  in  Section  1.2.4  for 
the  high  path-loss  regime  a  >  6.  We  hope  that  this  characterization  can  be  used 
to  subsequently  optimize  over  the  cache  location,  which  would  then  also  provide  an 
answer  to  the  cache  placement  problem. 

Service  heterogeneity  is  discussed  in  detail  in  Chapters  5  and  6. 


Chapter  2 


Network  Model  and  Notation 


In  this  chapter,  we  formally  define  the  network  and  channel  models,  and  give  a  rig¬ 
orous  definition  of  the  various  capacity  regions  mentioned  in  Chapter  1. 

Section  2.1  introduces  some  notation  used  throughout  this  thesis.  Section  2.2 
introduces  the  network  and  channel  models.  Section  2.3  formally  defines  the  unicast, 
multicast,  and  caching  capacity  regions. 

2.1  Notation  and  Conventions 

We  use  Knuth’s  asymptotic  notation.  For  functions  /,  g  :  N  — >  K+,  we  say  that 

•  f{n)  =  0(g(n ))  if  limsup)WOO  ^  <  oo, 

•  f(n)  =  M(g(n))  if  g(n)  =  0(f(n)), 

•  f(n)  =  0(g(n))  if  f(n)  =  0(g(n))  and  f(n)  =  Cl{g{n)), 

•  f(n)  =  o(g(n))  if  linx,^  ^  =  0. 

We  use  the  following  conventions:  K,  for  different  i,  and  K,  K,  . . . ,  denote 
strictly  positive  finite  constants  independent  of  n.  Vectors  and  matrices  are  denoted 
by  boldface  whenever  the  vector  or  matrix  structure  is  of  importance.  We  denote 
by  (-)t  conjugate  transpose.  To  simplify  notation,  we  assume,  when  necessary,  that 
large  real  numbers  are  integers  and  omit  [•]  and  ['J  operators.  For  the  same  reason, 
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we  also  suppress  dependence  on  n  within  proofs  whenever  this  dependence  is  clear 
from  the  context,  and  we  assume  that  n  >  2.  Throughout,  we  use  log(-)  and  ln(-)  for 
logarithms  with  respect  to  base  2  and  e,  respectively. 


2.2  Network  Model 

Consider  the  square 

A{n)  =  [0,  yj n\ 2 

of  area  n,  and  let  V(n)  C  A(n)  be  a  set  of  |V(n)|  =  n  nodes  on  A(n).  Each  node 
v  G  V ( n )  represents  a  wireless  device.  We  make  one  of  the  two  following  assumptions 
on  the  node  placement.  For  random  node  placement,  we  assume  that  the  n  nodes 
V(n)  are  placed  uniformly  at  random  in  an  independent  and  identically  distributed 
(i.i.d.)  fashion  on  the  area  A(n).  For  arbitrary  node  placement,  we  make  no  proba¬ 
bilistic  assumptions,  but  rather  assume  that  V (n)  is  an  arbitrary  deterministic  node 
placement  such  that  rUjV  >  rmin,  where  rUjV  is  the  Euclidean  distance  between  u  and 
v,  and  where  rmin  >  0  is  a  constant  independent  of  n.  Note  that,  in  either  case,  the 
node  placement  is  fixed  as  a  function  of  time.  In  other  words,  we  assume  that  the 
change  in  location  of  the  nodes  in  the  network  is  slow  enough  with  respect  to  the 
communication  delays.  We  also  assume  that  all  node  locations  are  known  throughout 
the  entire  network. 

We  use  the  following  channel  model.  The  (sampled)  received  signal  at  node  v  and 
time  t  is 

Vv(t)=  K,v{t)Xu(t)  +  Zyit)  (2.1) 

u£pn)\{t)} 

for  all  v  G  V(n),t  G  N,  where  {xu(t)}Utt  is  the  (sampled)  signal  sent  by  the  nodes 
in  V(n).  Here  {zv(t)}vj  are  i.i.d.  circularly  symmetric  complex  Gaussian  random 
variables  with  mean  0  and  variance  1,  and 

hu,v{t)  =  r^"/2exp(v/rl6'u,t,(t)),  (2.2) 
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for  -path-loss  exponent  a  >  2.  As  a  function  of  the  nodes  u,v  G  V(n),  the  phases 
{du,v(t)}u,v  are  assumed  to  be  i.i.d.  with  uniform  distribution  on  [0,  2n).  As  a  function 
of  time  t,  we  either  assume  that  {0UtV(t)}t  is  stationary  and  ergodic,  which  is  called 
fast  fading  in  the  following,  or  we  assume  that  {0UtV(t)}t  is  constant,  which  is  called 
slow  fading  in  the  following.  In  either  case,  we  assume  full  channel  state  information 
(CSI)  is  available  at  all  nodes,  i.e.,  each  node  knows  all  {0UjV(t)}UtV  at  time  t.  We 
also  impose  an  average  power  constraint  of  1  on  the  signal  {xu(t)}t  for  every  node 
u  G  V  (n). 

While  the  channel  model  used  is  quite  simple,  it  does  capture  several  effects  arising 
in  wireless  channels.  The  phase  shifts  {0U}V(t)}u^v  model  the  effect  of  small-scale 
movements  of  the  nodes  (on  the  order  of  the  wavelength).  The  i.i.d.  assumption  of 
the  phase  shifts  is  justified  by  the  large  (again,  relative  to  the  wavelength)  separation 
of  the  nodes  (but  see  the  comments  on  the  validity  of  the  model  for  very  large  n 
and  a  G  (2,3)  below).  The  ru^2  term  models  power  decay  over  larger  scales,  and 
is  assumed  not  to  be  affected  by  the  small-scale  movement.  Since  the  network  is 
assumed  to  be  static,  the  rff/2  terms  do  not  vary  with  time. 

The  full  CSI  assumption  made  above  is  quite  strong,  and  is  worth  commenting 
on.  First,  we  make  the  full  CSI  assumption  in  all  the  converse  results  in  this  thesis. 
This  implies  that  all  the  converses  also  hold  under  weaker  assumptions  on  the  CSI, 
and  hence  are  valid  as  well  under  a  wide  variety  of  more  realistic  assumptions  on 
the  availability  of  side  information.  Second,  all  achievability  results  presented  in  this 
thesis  can  be  shown  to  hold  under  weaker  assumptions  on  the  availability  of  CSI.  In 
all  cases,  a  2-bit  quantization  of  the  channel  state  {0UjV{t)}UjV  available  at  all  nodes  in 
V (■ n )  at  time  t  is  sufficient  to  obtain  the  same  scaling  behavior.  Moreover,  for  random 
node  placement  and  a  G  (2,3],  causal  quantized  receiver  only  CSI  is  sufficient.  And 
for  random  node  placement  and  a  >  3  no  CSI  is  needed.  We  comment  on  the  necessity 
of  CSI  in  more  detail  following  the  proofs  of  the  scaling  results  in  subsequent  chapters. 

We  should  also  point  out  that  recent  results  [11]  suggest  that,  under  certain  as¬ 
sumptions  on  the  location  of  scattering  elements,  for  a  G  (2,  3)  and  very  large  values 
of  n,  the  channel  model  used  here  (in  particular,  the  i.i.d.  assumption  on  the  phases 
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{9u,v(t)}u,v  as  a  function  of  the  nodes  u,v  G  V(n))  might  yield  results  that  are  too 
optimistic.  However,  the  authors  show  in  [12]  that,  under  different  assumptions  on 
the  scatterers,  the  channel  model  used  here  is  still  valid  also  for  a  G  (2,  3)  and  very 
large  values  of  n.  This  indicates  that  the  issue  of  proper  channel  modelling  in  the  low 
path-loss  regime  for  very  large  networks  is  somewhat  delicate  and  requires  further 
investigation. 


2.3  Capacity  Regions 

A  traffic  matrix  A  G  M^”x2n  associates  with  each  pair  of  subsets  U,  W  C  V ( n )  of  nodes 
the  number  A u,w-  This  A u,w  is  to  be  understood  as  the  rate  at  which  the  nodes  in 
W  request  a  common  message  available  at  the  set  of  caches  U.  We  are  interested  in 
the  set  of  traffic  matrices  that  the  wireless  network  can  support.  The  collection  of  all 
such  supportable  traffic  matrices  will  be  called  the  capacity  region  A (n)  C  M2r,x2n  of 
the  wireless  network. 

We  now  make  the  definition  of  A (n)  formal.  Fix  a  traffic  matrix  A  G  x2  and 

(T) 

a  blocklength  TgN.  Let  the  message  rnTJ  be  uniformly  distributed  on 

{l,...,2TXu’w}. 

(rr\ 

We  assume  that  the  random  variables  \'muw}u,wcv(n)  are  independent.  Note  that 

(T) 

rrVjjfv  is  requested  by  all  destination  nodes  w  G  W  and  is  available  at  all  nodes  u  E  U . 

(T) 

Hence  node  u  has  access  to  all  messages  mf  such  that  u  G  U,  i.e., 

(X1) 

\mU,w}u,WCV(n):ueU- 

The  message  set  at  node  u  G  V (■ n )  is  then  defined  as  the  set  of  all  possible  values  of 
these  message  available  at  u: 

0  0  {!,•••,  2TXu’w}  \/u  G  V(n). 

UcV(n):u£UWcV(n) 


32 


An  encoder  of  blocklength  T  is  a  collection  of  functions 


xP(t)  :  M<p  x  [0,  2n)tn{n-1)  x  C'”1  -►  C  Vt  e  {1, . . .  T},  u  G  V(n), 

mapping  the  messages  available  at  u  (i.e. ,  satisfying  u  G  U),  the  channel 

states  {{6lU)„(s)}u^ey(n)}‘=1  up  to  time  t,  and  the  received  signals  {j/u(s)}g=i  at  node 
u  up  to  time  t  —  1  into  a  channel  input  x^f\t)  at  time  t.  We  impose  that  the  encoder 
satisfies  the  power  constraint 

|  ut)  E  (l4T)  0)  {{mu,w}i  {{^(s)}^ev(n)K=n  (s/«(s)}«=i)  |2) 

<1  Vu  G  V  (n) , 

with  expectation  with  respect  to  {mu,w},  {2/u(s)}s=i  and  { { (s) }«,-ue v'(n) }i=i -  A 
decoder  of  blocklength  T  is  a  collection  of  functions 

Vu,l,w  ■  CT  x  [0,  2vr)Tn(n-1)  x  Mf )  {l, . . . ,  2TA^}  VC/,  W  C  V{n)  :  w  E  W, 

mapping  the  received  signal  {yw(t)}J=1  at  node  w  up  to  time  T,  the  channel  states 
{{6u,v(t)}u,vev(n)}T=i  up  t°  time  T,  and  the  messages  available  at  node  w 

(i.e.,  satisfying  w  E  U)  into  an  estimate  of  the  message  Together,  an 

encoder  and  a  decoder  of  blocklength  T  form  a  coding  scheme  of  blocklength  T.  The 
probability  of  error  or  such  a  coding  scheme  is  defined  as 


p(T)  A 


max  rnaxP (‘Puww{{yw(t)}J=1,  {{6u,v(t)}u,v}J=1,  {m 

U,WcV(n)  weW  \  ’  ’ 


(T)  i  _ 
UWJU,W:w£U 


)# 


m 


C t ) 
u,w 


(T) 

In  words,  Pe  ;  is  the  average  probability  of  error  of  incorrect  decoding  of  the  message 

(T) 

mijw  maximized  over  all  possible  caches  U  and  destination  groups  W . 


A  traffic  matrix  A  €  IR2"*2 


is  said  to  be  achievable ,  if  there  exists  a  sequence  of 
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coding  schemes  of  blocklengths  T  G  N  such  that 


lim  P}1"1  =  0. 

T— >oo 

Finally,  the  capacity  region  A (n)  C  M+  x2  is  the  closure  of  the  set  of  all  achievable 
traffic  matrices. 

A  few  examples  will  illustrate  the  utility  of  defining  the  notion  of  the  capacity 
region  A(n)  in  such  generality.  These  examples  introduce  important  special  cases 
that  will  be  analyzed  throughout  this  thesis. 


Example  2.1.  ( Unicast ) 

A  unicast  traffic  matrix  Auc  G  R”xn  associates  with  each  node  pair  u,w  G  V (■ n )  a 
number  Aj^.  This  number  is  the  rate  at  which  the  source  node  u  wants  to  transmit 
information  to  the  destination  node  w.  Note  that  we  allow  the  same  node  u  to  be 
source  for  multiple  destinations  and  the  same  node  w  to  be  destination  for  multiple 
sources.  In  such  situations,  the  multiple  messages  at  u  are  assumed  to  be  independent 
(and  similarly  for  the  messages  from  multiple  sources  at  w). 

For  a  specific  example,  assume  n  =  4,  and  label  the  nodes  as  {ip}y=1  =  V(n). 
Assume  further  that  node  u\  needs  to  transmit  a  message  mUltU2  to  node  u2  at  rate  1 
bit  per  channel  use  and  an  independent  message  mUltU3  to  node  u3  at  rate  2  bits  per 
channel  use.  Node  u2  needs  to  transmit  a  message  mU2jU3  to  node  u3  at  rate  4  bits 
per  channel  use.  All  the  messages  mUltU2,  mUuU3,  mU2jU3  are  independent.  This  traffic 
pattern  can  be  described  by  a  unicast  traffic  matrix  Auc  G  R4x4  with  A^*  =  1, 

A,^3  =  2,  A£tt  =  4,  and  A^  =  0  otherwise.  Note  that  in  this  example  node  u  1 
is  source  for  two  (independent)  messages,  and  node  u3  is  destination  for  two  (again 
independent)  messages.  Node  W4  in  this  example  is  neither  source  nor  destination  for 
any  message,  and  can  be  understood  as  a  helper  node. 

Now,  for  each  unicast  traffic  matrix  Auc  G  M+Xn,  we  can  construct  a  traffic  matrix 
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A  G  R+"x2™  as 

{Aj^  if  U  —  {«},  W  =  {w}  for  some  u,w  G  V (n), 

0  else. 

A  unicast  traffic  matrix  Auc  is  achievable  if  the  corresponding  traffic  matrix  A  is.  The 
unicast  capacity  region  Auc(n)  C  M"XM  is  defined  as  the  closure  of  the  set  of  achievable 
unicast  traffic  matrices.  Note  that  the  unicast  capacity  region  is  the  subset  of  the 
capacity  region  arising  from  intersecting  A(n)  with  the  subspace  corresponding  to 
(U,  W)  pairs  of  the  form  ({«},  {u>})  for  some  u,w  G  V (n). 

The  notion  of  unicast  traffic  defined  in  the  last  paragraph  is  very  general.  Two 
special  cases  of  unicast  traffic  matrices  are,  however,  worth  mentioning. 

A  unicast  traffic  matrix  Auc  is  called  a  permutation  traffic  matrix  if  for  every 
node  u  G  V(n)  there  is  exactly  one  w  G  V(n)  \  {w}  such  that  Aj^y  >  0  and  exactly 
one  w  G  V (n)  \  {«}  such  that  Aj-^  >  0.  In  words,  for  a  permutation  unicast  traffic 
matrix,  every  node  is  source  and  destination  exactly  once.  A  permutation  unicast 
traffic  matrix  is  said  to  have  uniform  rate  if  for  all  u,w  G  V (n)  we  have  A^  =  {0, 1{ 
(i.e,  each  of  the  n  source-destination  pairs  wants  to  transmit  messages  at  rate  1).  For 
a  permutation  traffic  matrix  Auc  with  uniform  rate,  we  define  the  throughput  capacity 
p*{n)  as  the  largest  value  of  p{n)  such  that  p{n) Auc  is  achievable.  In  other  words 
p*{n )  is  the  largest  uniformly  achievable  per-node  rate.  For  ease  of  notation,  we  will 
often  just  refer  to  the  throughput  capacity  p*{n )  for  a  permutation  traffic  matrix  Auc 
without  explicit  mentioning  of  the  uniform  rate  requirement. 

A  unicast  traffic  matrix  Auc  is  called  a  random  source- destination  pairing  with 
uniform  rate  if  it  results  from  picking  for  each  node  «Gf  (n)  one  other  node  w  inde¬ 
pendently  and  uniformly  at  random  from  V(n)  \  {u}  and  setting  A^  =  1.  Random 
source-destination  pairings  with  uniform  rate  are  closely  related  to  permutation  traf¬ 
fic  with  uniform  rate  for  which  all  source-destination  pairs  are  at  a  distance  Q(y/n), 
and  for  scaling  purposes  the  two  are  equivalent.  0 

Example  2.2.  ( Multicast ) 


35 


A  multicast  traffic  matrix  AMC  G  M”x2  associates  with  each  node  u  €  V(n)  and 
subset  W  C  V(n)  a  number  Ajf^.  This  number  is  the  rate  at  which  the  source  node 
u  wants  to  multicast  (identical)  information  to  all  the  destination  nodes  w  G  W. 
Note  that  we  do  not  impose  that  a  source  node  u  multicasts  information  only  to 
one  group  of  destinations  W.  In  fact,  for  every  u  G  V(n)  there  could  be  two  (or 
more)  subsets  W,  W  C  V (n)  with  W  W  such  that  Aj^  >  0  and  A^~  >  0.  In 
such  a  situation,  the  messages  for  the  two  groups  of  destinations  are  assumed  to  be 
independent.  Similarly,  two  nodes  could  want  to  multicast  (independent)  messages 
to  the  same  set  of  destination  nodes. 

For  a  specific  example,  assume  again  n  =  4,  and  label  the  nodes  as  {ui}*=1  =  V (n). 
Assume  that  node  U\  needs  to  transmit  the  same  message  mUl:{U2jU3tU4}  to  all  nodes 
ui,U2,U3  at  a  rate  of  1  bit  per  channel  use  and  an  independent  message  mul^U2} 
to  only  node  2  at  rate  2  bits  per  channel  use.  Node  2  needs  to  transmit  a  mes¬ 
sage  rnU2^UltU3j  to  both  ui,u3  at  rate  4  bits  per  channel  use.  All  the  messages 
'mu1,{u2,u3,u4},  ^ui,{u2})  mu2,{u1,u3}  are  independent.  This  traffic  pattern  can  be  de¬ 
scribed  by  a  multicast  traffic  matrix  AMC  G  M+x16  with  A^1iC|,U2  U3  U4j  =  1,  A^C|((  )j  =  2, 
A^V  „3 1  =  4,  and  A^,r  =  0  otherwise.  Note  that  in  this  example  node  U\  is  source 
for  two  (independent)  multicast  messages,  and  node  u2  and  u3  are  destinations  for 
more  than  one  message.  The  message  mul){U2)U 3jtt4}  is  destined  for  the  all  the  nodes 
in  the  network,  and  can  hence  be  understood  as  a  broadcast  message.  The  message 
mUl,{u2}  is  only  destined  for  one  node,  and  can  hence  be  understood  as  a  private 
message. 

For  each  multicast  traffic  matrix  AMC  G  M”x2",  we  can  construct  a  traffic  matrix 
A  G  M™"  as 

j  A,^.  if  U  =  {«}  for  some  u  G  V(n), 

A  u,w  =  \ 

I  0  else. 

A  multicast  traffic  matrix  AMC  is  achievable  if  the  corresponding  traffic  matrix  A  is. 
The  multicast  capacity  region  AMC(n)  C  M”x2"  is  defined  as  the  closure  of  the  set 
of  achievable  unicast  traffic  matrices.  As  before,  the  multicast  capacity  region  is 
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the  subset  of  the  capacity  region  arising  from  intersecting  A(n)  with  the  subspace 
corresponding  to  (U,  W)  pairs  of  the  form  ({«},  W)  for  some  u  G  V{n). 

We  note  that  this  definition  of  multicast  is  very  general.  0 


Example  2.3.  ( Caching ) 

A  caching  traffic  matrix  ACA  G  xn  associates  with  each  node  w  G  V(n)  and 
subset  U  C  V(n)  a  number  A^,.  This  number  is  the  rate  at  which  the  destination 
node  w  requests  information  that  is  available  at  all  the  caches  u  E  U .  Note  that  we 
do  not  impose  that  a  destination  node  w  requests  information  from  only  one  group 
of  caches  U.  In  fact,  for  every  w  G  V(n)  there  could  be  two  (or  more)  subsets 
U,  U  C  V (■ n )  with  U  f  U  such  that  XfAw  >  0  and  A~auj  >  0.  In  such  a  situation, 
the  messages  for  the  two  groups  of  caches  are  assumed  to  be  independent.  Similarly, 
the  same  set  of  caches  can  hold  (independent)  messages  for  more  than  one  different 
destination  nodes.  For  example,  a  situation  where  parts  of  a  message  requested  by  a 
destination  node  w  is  available  at  caches  U  and  a  different  part  is  available  at  caches 
U  could  be  modeled  as  two  messages  (one  corresponding  to  each  part)  available  at  U 
and  I/,  respectively. 

For  a  specific  example  consider  again  {ui}f=1  =  V(n)  with  n  =  4.  Assume  that 
u i  requests  a  message  myU3tU4y:U1  available  at  the  caches  u3,  and  w4  at  rate  1  bit  per 
channel  use  and  an  independent  message  m{U3y,Ul  available  only  at  u3  at  a  rate  of  2 
bits  per  channel  use.  Node  U2  requests  a  message  nr{U3iU4j)U2  available  at  the  caches 
u3  and  m4  at  a  rate  of  4  bits  per  channel  use.  The  messages  m^U3jU4yjU1,  myU3yjU1,  and 
m{u3,u4},u 2  are  assumed  to  be  independent.  This  traffic  pattern  can  be  described  by 
a  caching  traffic  matrix  ACA  G  M^6x4  with  AfAW)Ui  =  1,  AfAa})Ui  =  2,  AfA3  U4}  u2  =  4, 
and  Xffj,  =  0  otherwise.  Note  that  in  this  example  node  u \  is  destination  for  two 
(independent)  caching  messages,  and  node  u3  and  w4  serve  as  caches  for  more  than 
one  message  (but  these  messages  are  assumed  independent). 

For  each  caching  traffic  matrix  ACA  G  xn,  we  can  construct  a  traffic  matrix 
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A  g  M™" 


as 


{A“  if  W  —  {w}  for  some  w  G  V(n), 

0  else. 

A  caching  traffic  matrix  ACA  is  achievable  if  the  corresponding  traffic  matrix  A  is. 
The  caching  capacity  region  A CA(n)  C  xn  is  defined  as  the  closure  of  the  set  of 
achievable  caching  traffic  matrices.  As  before,  the  caching  capacity  region  is  the  subset 
of  the  capacity  region  arising  from  intersecting  A(n)  with  the  subspace  corresponding 
to  (U,  W)  pairs  of  the  form  (U,  {w})  for  some  w  G  V(n). 

This  definition  of  caching  is  completely  general  in  terms  of  the  number  and  location 
of  caches  and  their  destinations  as  well  as  the  amounts  of  traffic  between  them. 
Moreover,  by  the  definition  of  achievability,  we  do  not  impose  that  for  a  particular 
(U,  w )  pair  one  cache  u  6  h  transmits  the  entire  requested  message  to  the  destination 
node  w.  Rather,  we  allow  all  caches  to  participate  in  the  transmission  of  the  message. 
Thus,  this  definition  of  caching  is  also  general  in  terms  of  cache  selection.  0 


We  note  that  the  definition  of  the  capacity  region  (and  hence  also  the  ones  for 
unicast,  multicast,  and  caching)  contain  several  trivial  dimensions.  These  are  the 
dimensions  corresponding  to  (U,  W)  pairs  such  that  either  W  C  U  with  W  ^  0,  or 
U  =  0,  or  W  =  0.  The  first  such  case  can  arise  in  unicast,  multicast,  and  caching  and 
corresponds  to  w  =  u,  W  =  {u},  and  w  G  U,  respectively.  The  second  case  arises 
only  in  caching.  The  third  case  arises  only  in  multicast.  We  now  analyze  these  three 
trivial  cases  in  more  detail. 

Consider  an  entry  A u,w  of  the  traffic  matrix  A  such  that  W  C  U.  Note  that 
the  decoder  g>\j^w  at  node  w  G  W  has  access  to  the  messages  w-weu-  -*-11 

(T) 

particular,  since  W  C  U  and  hence  w  G  U,  it  has  access  to  rnTj  {v ,  and  can  therefore 
easily  decode  this  message  by  simply  setting 

i.  {{0u,«(*)}u,«ev(n)}£= i,  {m^w}u,W:weu)  =  mu,w- 
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Since  this  is  true  for  every  w  G  W,  we  can  choose  A u,w  arbitrarily  large  and  still 
guarantee  successful  decoding.  Hence  the  capacity  region  A (n)  is  unbounded  along 
dimension  (U,  W)  for  W  C  U. 

Consider  now  an  entry  A u,w  °f  a  traffic  matrix  A  such  that  U  —  0.  Then  m^w  ^ 

(rr\  (t\  (t\ 

Mu  7  for  any  u  G  V(n),  and  therefore  no  encoder  xu  (t)  has  access  to  m\j^.  Hence 

(T)  (T) 

the  received  signal  at  any  decoder  (p\jww  for  w  G  W  is  independent  of  rrij-j  {v  and  the 
resulting  probability  of  error  will  be  approaching  one  as  T  — >  oo  unless  A u,w  —  0. 
Thus  the  capacity  region  A(n)  is  zero  along  dimension  (U,  W)  for  U  =  0. 

Finally,  consider  an  entry  Xu.w  of  a  traffic  matrix  A  such  that  W  —  0.  Then 
there  exists  no  decoder  (Pjj{yw  such  that  w  G  W,  and  therefore  we  can  choose  A u,w 
arbitrarily  large  without  affecting  the  probability  of  error.  Hence  the  capacity  region 
A (n)  is  unbounded  along  dimension  (U,  W)  for  W  —  0. 

While  the  capacity  region  A (n)  and  all  its  special  cases  have  certain  dimensions 
that  are  trivial,  these  are  only  very  few.  In  particular,  for  the  n  x  n  dimensional 
unicast  capacity  region  Auc(n)  only  n  dimensions  are  trivial,  for  the  n  x  2n  dimen¬ 
sional  multicast  capacity  region  AMC(n)  only  2 n  dimensions  are  trivial,  for  the  2”  x  n 
dimensional  caching  capacity  region  ACA(n)  only  n( 2n~1  +  1)  dimensions  are  trivial. 
In  other  words,  the  nontrivial  number  of  dimensions  of  the  unicast,  multicast,  and 
caching  capacity  regions  are  n(n  —  1),  n( 2n  —  2),  and  n(2"”1  —  1),  respectively.  Thus 
the  number  of  trivial  dimensions  is  negligible,  and  including  them  in  the  definition 
allows  to  simplify  notation  considerably. 

Note  that  the  capacity  region  A (n)  is  (in  most  cases,  see  below)  a  random  variable 
with  probabilistic  structure  determined  by  the  assumptions  on  the  node  placement 
and  the  fading  model.  More  precisely,  for  slow  fading  (in  which  the  channel  gains  are 
random  across  nodes,  but  constant  across  time),  A(n)  is  a  function  of  the  realization 
of  those  channel  gains.  In  contrast,  for  the  fast  fading  case  (in  which  the  channel 
gains  are  ergodic  across  time),  the  coding  scheme  can  average  out  any  short  time 
fluctuations  in  the  channel  gains,  and  hence  A (n)  depends  only  on  the  expected 
behavior  of  the  channel  gains  and  not  on  their  realization.  The  capacity  region 
A (n)  is  always  a  function  of  the  node  placement.  However,  this  only  introduces 
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randomness  into  the  behavior  of  A (n)  if  the  node  placement  is  itself  random  (as 
opposed  to  arbitrary  deterministic  node  placement).  Finally,  A (n)  never  depends  on 
the  realization  of  the  noise  process,  as  this  process  is  always  assumed  to  be  ergodic. 
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Chapter  3 


Location  Heterogeneity 


In  this  chapter,  we  analyze  the  impact  of  location  heterogeneity  on  the  performance 
of  a  wireless  network.  To  this  end,  we  consider  wireless  networks  with  arbitrary  (i.e., 
deterministic)  node  placement  (with  minimum-separation  constraint).  As  a  measure 
of  performance,  we  use  the  throughput  capacity  p*(n)  under  permutation  traffic  (i.e. 
each  node  is  source  and  destination  for  exactly  one  pair,  and  there  are  n  such  source- 
destination  pairs  with  uniform  traffic  demand).  Before  we  proceed,  recall  that  under 
random  node  placement  the  throughput  capacity  scales  like  p*(n)  =  n1-mm3.«/2±°(1) ) 
and  that  for  small  path-loss  exponents  a  G  (2,  3]  cooperative  communication  is  order 
optimal  and  for  large  path-loss  exponents  a  >  3  multi-hop  communication  is  order 
optimal. 

The  impact  of  this  arbitrary  node  placement  depends  crucially  on  the  path-loss 
exponent  a.  For  small  path- loss  exponents  a  G  (2,3],  we  show  that  for  random 
source-destination  pairing,  the  throughput  capacity  is  upper  bounded  as  p*(n)  = 
0(log6(n)n1_“/2).  We  then  present  a  novel  cooperative  communication  scheme  that 
achieves  for  any  node  placement  and  path-loss  exponent  a  >  2  a  per-node  rate  of 
ni-a/2— o(i).  Thus,  our  cooperative  communication  scheme  is  essentially  order  optimal 
for  any  such  arbitrary  network  with  a  G  (2,  3].  In  other  words,  in  the  small  path- loss 
regime,  the  scaling  of  p*(n)  is  the  same  irrespective  of  the  regularity  of  the  node 
placement. 

The  situation  is,  however,  quite  different  for  large  path-loss  exponents  a  >  3. 
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We  show  that  in  this  regime  the  scaling  of  p*{n )  depends  crucially  on  the  regularity 
of  the  node  placement,  and  multi-hop  communication  may  not  be  order  optimal  for 
any  value  of  a.  In  fact,  for  less  regular  networks  we  need  more  complicated  coopera¬ 
tive  communication  schemes  to  achieve  optimal  network  performance.  Towards  that 
end,  we  present  a  family  of  communication  schemes  that  smoothly  “interpolate”  be¬ 
tween  cooperative  communication  and  multi-hop  communication,  and  in  which  nodes 
communicate  at  scales  that  vary  smoothly  from  local  to  global.  The  amount  of  “in¬ 
terpolation”  between  the  cooperative  and  multi-hop  schemes  depends  on  the  level 
of  regularity  of  the  underlying  node  placement.  We  establish  the  optimality  of  this 
family  of  schemes  for  all  a  >  3  under  adversarial  node  placement  with  regularity 
constraint. 

The  remainder  of  this  chapter  is  organized  as  follows.  Section  3.1  provides  for¬ 
mal  statements  of  our  results.  Sections  3.2  and  3.3  describe  our  new  cooperative 
communication  scheme  (for  the  a  G  (2,3]  regime)  and  “interpolation”  scheme  (for 
the  a  >  3  regime)  for  arbitrary  wireless  networks.  Sections  3.4  through  3.9  contain 
proofs.  Finally,  Sections  3.10  and  3.11  contain  discussions  and  concluding  remarks. 


3.1  Main  Results 

This  section  presents  the  formal  statement  of  our  results.  In  Section  3.1.1,  we  consider 
low  path-loss  exponents,  i.e. ,  a  G  (2,3].  We  present  a  cooperative  communication 
scheme  for  arbitrary  node  placement  and  for  either  fast  or  slow  fading.  We  show  that 
this  communication  scheme  is  order  optimal  for  all  node  placements  when  a  G  (2,3]. 
In  Section  3.1.2,  we  consider  high  path-loss  exponents,  i.e.,  a  >  3.  We  present  a 
communication  scheme  that  “interpolates”  between  the  cooperative  and  the  multi¬ 
hop  communication  schemes,  depending  on  the  regularity  of  the  node  placement. 
We  show  that  this  communication  scheme  is  order  optimal  under  adversarial  node 
placement  with  regularity  constraint  when  a  >  3. 
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3.1.1  Low  Path-Loss  Regime  a  G  (2,3] 

The  first  result  proposes  a  novel  cooperative  communication  scheme,  called  hierarchi¬ 
cal  relaying  in  the  following,  and  bounds  the  per-node  rate  pHR(n)  that  it  achieves. 
This  provides  a  lower  bound  to  p*(n),  the  largest  achievable  per-node  rate.  The 
hierarchical  relaying  scheme  enables  cooperative  communication  on  the  scale  of  the 
network  size.  In  the  random  node  placement  case,  this  cooperation  could  be  enabled 
in  a  cluster  around  the  source  node  (cooperatively  transmitting)  and  in  a  cluster 
around  its  destination  node  (cooperatively  receiving).  With  arbitrary  node  place¬ 
ment,  such  an  approach  no  longer  works,  as  both  the  source  as  well  as  the  destination 
nodes  may  be  isolated.  The  hierarchical  relaying  scheme  circumvents  this  issue  by 
relaying  data  between  each  source-destination  pair  over  a  densely  populated  region 
in  the  network.  A  detailed  description  of  this  scheme  is  provided  in  Section  3.2,  the 
proof  of  Theorem  3.1  is  contained  in  Section  3.5. 

Theorem  3.1.  Under  fast  fading,  for  any  a  >  2,  rmin  e  (0,1),  and  S  e  (0,1/2), 
there  exists 

6i(n)  >  n-°(iog*-1/2(n)) 

such  that  for  any  n,  node  placement  V  (■ n )  with  minimum  separation  rmin,  and  per¬ 
mutation  traffic  matrix  Auc(n),  we  have 

p* (n)  >  pHR(n)  >  61(n)n1“a/2. 

The  same  conclusion  holds  for  slow  fading  with  probability  at  least 

1  -  exp  (  -  2a(logl/2+'(n)))  =  1  -  o(l) 


as  n  — >  oo. 

Theorem  3.1  shows  that  the  per-node  rate  pHR(n)  achievable  by  the  hierarchical 
relaying  scheme  is  at  least  n1-“/2-/3(n) ;  where  the  “loss”  term  j3{n)  converges  to  zero 
as  n  — >  oo  at  a  rate  arbitrarily  close  to  0[  log_1^2(n))  (by  choosing  S  small).  The 
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performance  of  the  hierarchical  relaying  scheme  can  intuitively  be  understood  as 
follows.  As  mentioned  before,  the  scheme  achieves  cooperation  on  a  global  scale. 
This  leads  to  a  multi-antenna  gain  of  order  n.  On  the  other  hand,  communication  is 
over  a  distance  of  order  n1/2,  leading  to  a  power  loss  of  order  n~a /2.  Combining  these 
two  factors  results  in  a  per-node  rate  of  n1-Q/2. 

We  note  that  Theorem  3.1  remains  valid  under  somewhat  weaker  conditions  than 
having  minimum  separation  rmin  G  (0,1).  Specifically,  we  show  that  the  result  of 
Ozgiir  et  al.  [38]  can  be  recovered  through  Theorem  3.1  as  the  random  node  placement 
satisfies  these  weaker  conditions.  We  discuss  this  in  more  detail  in  Section  3.10.4. 

The  next  theorem  establishes  optimality  of  the  hierarchical  relaying  scheme  in 
the  range  of  a  G  (2,  3]  for  arbitrary  node  placement.  The  proof  of  the  theorem  is 
presented  in  Section  3.6. 

Theorem  3.2.  Under  either  fast  or  slow  fading,  for  any  a  G  (2,3]  ,  rmin  G  (0,1), 
there  exists  b2(n)  =  0(log6(n))  such  that  for  any  n,  node  placement  V(n )  with  mini¬ 
mum  separation  rmin,  and  for  Auc(n)  chosen  uniformly  at  random  from  the  set  of  all 
permutation  traffic  matrices,  we  have 

p*(n)  <  b2(n)n1~a^2 
with  probability  1  —  o(l)  as  n  — »  oo. 

Note  that  Theorem  3.2  holds  only  with  probability  1  —  o(l)  for  different  reasons 
for  the  slow  and  fast  fading  case.  For  fast  fading,  this  is  due  to  the  randomness  in 
the  selection  of  the  permutation  traffic  matrix.  In  other  words,  for  fast  fading,  with 
high  probability  we  select  a  traffic  matrix  for  which  the  theorem  holds.  For  the  slow 
fading  case,  there  is  additional  randomness  due  to  the  fading  realization.  Here,  with 
high  probability  we  select  a  traffic  matrix  and  we  experience  a  fading  for  which  the 
theorem  holds. 

Comparing  Theorems  3.1  and  3.2,  we  see  that  for  a  G  (2,3]  the  proposed  hicrar- 
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chical  relaying  scheme  is  order  optimal,  in  the  sense  that 


log  (yHR(n))  =  log(p*(n)) 
n— >oo  log(n)  n^oo  log(n) 


1  -  a/2. 


Moreover,  the  rate  it  achieves  is  the  same  order  as  is  achievable  in  the  case  of  randomly 
placed  nodes.  Hence  in  the  low  path- loss  regime  a  €  (2,3],  the  heterogeneity  caused 
by  the  arbitrary  node  placement  has  no  effect  on  achievable  communication  rates. 


3.1.2  High  Path-Loss  Regime  a  >  3 

We  now  turn  to  the  high  path-loss  regime  a  >  3.  In  the  case  of  randomly  placed 
nodes,  multi-hop  communication  achieves  a  per- node  rate  of  pMH(n)  =  with 

probability  1  —  o(l)  and  is  order  optimal  for  a  >  3.  For  arbitrarily  placed  nodes, 
the  situation  is  quite  different  as  Theorem  3.3  shows.  The  proof  of  Theorem  3.3  is 
contained  in  Section  3.7. 

Theorem  3.3.  Under  either  fast  or  slow  fading,  for  any  a  >  3,  for  any  n,  there 
exists  a  node  placement  V(n)  with  minimum  separation  1/2  such  that  for  Auc(n) 
chosen  uniformly  at  random  from  the  set  of  all  permutation  traffic  matrices,  we  have 


p*  (n)  <  2 2+5«nW2, 
pMH(n)  <  4“n-a/2, 

as  n  —>  oo  with  probability  1  —  o(l). 

Comparing  Theorem  3.3  with  Theorem  3.1  shows  that  under  adversarial  node 
placement  with  minimum-separation  constraint  the  hierarchical  relaying  scheme  is 
order  optimal  even  when  a  >  3.  Moreover,  the  theorems  show  that  there  exist  node 
placements  satisfying  a  minimum  separation  constraint  for  which  hierarchical  relaying 
achieves  a  rate  of  at  least  a  factor  of  order  n  higher  than  multi-hop  communication 
for  any  a  >  3.  In  other  words,  for  those  node  placements  cooperative  communication 
is  necessary  for  order  optimality  also  for  any  a  >  3,  in  stark  contrast  to  the  situation 
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with  random  node  placement,  where  multi-hop  communication  is  order  optimal  for 
all  a  >  3. 

Theorem  3.3  suggests  that  it  is  the  level  of  regularity  of  the  node  placement  that 
decides  what  scheme  to  choose  for  path-loss  exponent  a  >  3.  So  far,  we  have  seen  two 
extreme  cases:  For  random  node  placement,  resulting  in  very  regular  node  placements 
with  high  probability,  only  local  cooperation  is  necessary  and  multi-hop  is  an  order- 
optimal  communication  scheme.  For  adversarial  arbitrary  node  placement,  resulting 
in  a  very  irregular  node  placement,  global  cooperation  is  necessary  and  hierarchical 
relaying  is  an  order-optimal  communication  scheme.  We  now  make  this  notion  of 
regularity  precise  and  introduce  a  cooperative  multi-hop  communication  scheme  that 
“interpolates”  between  multi-hop  communication  and  hierarchical  relaying  depending 
on  the  regularity  of  the  node  placement. 

Before  we  state  the  result,  we  need  to  introduce  some  notation.  Consider  again 
a  node  placement  V{n)  C  A{n)  with  minimum  separation  rmin  G  (0, 1).  Divide  A{n) 
into  squares  of  sidelength  d(n)  <  y/n,  and  fix  a  constant  p  e  (0, 1].  We  say  that  V ( n ) 
is  p-regular  at  resolution  d(n)  if  every  such  square  contains  at  least  pd2{n)  nodes. 
Note  that  every  node  placement  is  trivially  1-regular  at  resolution  y/n]  a  random 
node  placement  can  be  shown  to  be  /^-regular  at  resolution  log(n)  with  probability 
1  —  o(  1)  as  n  — >  oo  for  any  p  <  1;  and  nodes  that  are  placed  on  each  point  in  the 
integer  lattice  inside  A{n)  are  1-regular  at  resolution  1. 

The  cooperative  multi-hop  scheme  enables  cooperative  communication  on  the  scale 
of  regularity  d(n).  Neighboring  squares  of  sidelength  d(n)  cooperatively  communicate 
with  each  other.  To  transmit  between  a  source  and  its  destination,  we  use  multi-hop 
communication  over  those  squares.  In  other  words,  we  use  cooperative  communica¬ 
tion  at  small  scale  d(n),  and  multi-hop  communication  at  large  scale  y/n.  For  regular 
node  placements,  i.e.,  d(n)  =  1,  the  cooperative  multi-hop  scheme  becomes  the  clas¬ 
sical  multi-hop  scheme.  For  very  irregular  node  placement,  i.e.,  d(n)  =  n1/2,  the 
cooperative  multi-hop  scheme  becomes  the  hierarchical  relaying  scheme  discussed  in 
the  last  section. 

The  next  theorem  provides  a  lower  bound  on  the  per- node  rate  pCMH(n)  achievable 
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with  the  cooperative  multi-hop  scheme.  The  proof  of  the  theorem  can  be  found  in 
Section  3.8. 

Theorem  3.4.  Under  fast  fading,  for  any  a  >  2,  rmin  G  (0,1),  p  G  (0,1),  and 
5  G  (0,1/2)  there  exists 

b3(n )  >  n-o(iog5-1/2(n)) 

such  that  for  any  n,  node  placement  V  (■ n )  with  minimum  separation  rmin,  and  per¬ 
mutation  traffic  matrix  A uc(n),  we  have 

p*(n )  >  pCMH(n)  >  63(n)d*3~a(n)n-1/2, 


where 


d*(n)  =  min  {6  :  V(n)  is  p  regular  at  resolution  h}. 


The  same  conclusion  holds  for  slow  fading  with  probability  1  —  o(l)  as  n  — >  oo. 


Theorem  3.4  shows  that  if  V(n)  is  regular  at  resolution  d*(n)  then  a  per- node  rate 
of  at  least  pCMH(n)  >  cp,3~a(n)n-1/2-/3(«)  js  achievable,  where,  as  before,  the  “loss” 
term  j3{n)  converges  to  zero  as  n  oo  at  a  rate  arbitrarily  close  to  0(log"  ”1/2(n)). 
The  performance  of  the  cooperative  multi-hop  scheme  can  intuitively  be  understood 
as  follows.  The  scheme  achieves  cooperation  on  a  scale  of  d2(n).  This  leads  to  a  multi- 
antenna  gain  of  order  d2(n).  On  the  other  hand,  communication  is  over  a  distance  of 
order  d(n),  leading  to  a  power  loss  of  order  d~a(n).  Moreover,  each  source-destination 
pair  at  a  distance  of  order  n1/2  must  transmit  its  data  over  order  n1//2d”1(n)  many 
hops,  leading  to  a  multi-hop  loss  of  n-1/2d(n).  Combining  these  three  factors  results 
in  a  per-node  rate  of  d3-a(n)n-1/2. 

The  next  theorem  shows  that  Theorem  3.4  is  tight  under  adversarial  node  place¬ 
ment  under  a  constraint  on  the  regularity.  The  proof  of  the  theorem  is  presented  in 
Section  3.9. 


Theorem  3.5.  Under  either  fast  or  slow  fading,  for  any  a  >  3,  there  exists  64(71)  = 
0(log6(?7,)),  such  that  for  any  n,  and  d*{n),  there  exists  a  node  placement  V{n)  with 
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minimum  separation  1/2  and  1/ 2-regular  at  resolution  d*(n)  such  that  for  Auc(n) 
chosen  uniformly  at  random  from  the  set  of  all  permutation  traffic  matrices,  we  have 


p*{n )  <  64(n)cf 3  a{n)n  1^2, 


with  probability  1  —  o(l)  as  n  — >  oo. 


As  an  example,  assume  that 


d*(n )  =  nv 


for  some  rj  >  0.  Then  Theorem  3.4  shows  that  for  any  node  placement  of  regularity 
d*(n )  and  a  >  3, 

pCMH(n)  >  n(3-«)»»-i/2-^(n). 


where  / 3{n )  converges  to  zero  as  n  oo  at  a  rate  arbitrarily  close  to  0(log'  ”1/2(n)). 
In  other  words 


lim 

oo 


log(pCMH(n)) 

log(n) 


>  (3  -  a)rj  -  1/2. 


Moreover,  by  Theorem  3.5  there  exist  node  placements  with  same  regularity  such 
that  for  random  permutation  traffic  with  high  probability  p*(n)  is  (essentially)  of  the 
same  order,  in  the  sense  that 


lim 

n— KX) 


log(p*(w)) 

log(n) 


—  (3 


a)rj  -  1/2. 


In  particular,  for  rj  —  0  (i.e.,  regular  node  placement),  and  for  rj  =  log log(n) /  log(n) 
(i.e.,  random  node  placement),  we  obtain  the  order  n^1/2  scaling  as  expected.  For 
rj  —  1/2  (i.e.,  completely  irregular  node  placement),  we  obtain  the  order  n1-"/2  scaling 
as  in  Theorems  3.1  and  3.3. 


3.2  Hierarchical  Relaying  Scheme 

This  section  describes  the  architecture  of  our  hierarchical  relaying  scheme.  On  a  high 
level,  the  construction  of  this  scheme  is  as  follows.  Consider  n  nodes  V(n)  placed 
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Figure  3-1:  Sketch  of  one  level  of  the  hierarchical  relaying  scheme.  Here  {(wj,  iy»)}f=1 
are  three  source-destination  pairs.  Groups  of  source-destination  pairs  relay  their 
traffic  over  dense  subsquares,  which  contain  a  number  of  nodes  proportional  to  their 
area  (shaded).  We  time  share  between  the  different  dense  subsquares  used  as  relays. 
Within  each  of  these  relay  subsquares  the  scheme  is  used  recursively  to  enable  joint 
decoding  and  encoding  at  each  relay. 


arbitrarily  on  the  square  region  A(n)  with  a  minimum  separation  rmin.  Divide  A{n) 
into  subsquares  of  equal  size.  Call  a  subsquare  dense ,  if  it  contains  a  number  of 
nodes  proportional  to  its  area.  For  each  source-destination  pair,  choose  such  a  dense 
subsquare  as  a  relay ,  over  which  it  will  transmit  information  (see  Figure  3-1). 

Consider  now  one  such  relay  subsquare  and  the  nodes  that  are  transmitting  in¬ 
formation  over  it.  If  we  assume  for  the  moment  that  all  the  nodes  within  the  same 
relay  subsquare  could  cooperate  then  we  would  have  a  multiple  access  channel  (MAC) 
between  the  source  nodes  and  the  relay  subsquare,  where  each  of  the  source  nodes 
has  one  transmit  antenna,  and  the  relay  subsquare  (acting  as  one  node)  has  many 
receive  antennas.  Between  the  relay  subsquare  and  the  destination  nodes,  we  would 
have  a  broadcast  channel  (BC),  where  each  destination  node  has  one  receive  antenna, 
and  the  relay  subsquare  (acting  again  as  one  node)  has  many  transmit  antennas. 
The  cooperation  gain  from  using  this  kind  of  scheme  arises  from  the  use  of  multiple 
antennas  for  these  multiple  access  and  broadcast  channels. 

To  actually  enable  this  kind  of  cooperation  at  the  relay  subsquare,  local  com¬ 
munication  within  the  relay  subsquares  is  necessary.  It  can  be  shown  that  this  local 
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communication  problem  is  actually  the  same  as  the  original  problem,  but  at  a  smaller 
scale.  Hence  we  can  use  the  same  scheme  recursively  to  solve  this  subproblem.  We 
terminate  the  recursion  after  several  iterations,  at  which  point  we  use  simple  time 
sharing  to  bootstrap  the  scheme. 

The  construction  of  the  hierarchical  relaying  scheme  is  presented  in  detail  in  Sec¬ 
tion  3.2.1.  A  back-of-the-envelope  calculation  of  the  per-node  rate  it  achieves  is 
presented  in  Section  3.2.2.  A  detailed  analysis  of  the  hierarchical  relaying  scheme  is 
presented  in  Sections  3.4  and  3.5. 


3.2.1  Construction 


Recall  that 

A(b)  ±  [0,  Vbf 


is  the  square  region  of  area  b.  The  scheme  described  here  assumes  that  n  nodes  are 
placed  arbitrarily  in  A{n)  with  minimum  separation  rmin  G  (0,1).  We  want  to  find 
some  rate,  say  po,  that  can  be  supported  for  all  n  source-destination  pairs  of  a  given 
permutation  traffic  matrix  Auc(n).  The  scheme  that  is  described  below  is  “recursive” 
(and  hence  hierarchical)  in  the  following  sense.  In  order  to  achieve  rate  p0  f°r  n  nodes 
in  A(n),  it  will  use  as  a  building  block  a  scheme  for  supporting  rate  p±  for  a  network 
of 


n  i 


A 


n 

27  (n) 


nodes  over  A(af)  (square  of  area  cq)  with 


for  any  permutation  traffic  matrix  Auc(ni)  of  n \  nodes.  Here  the  branching  factor 
7 (n)  is  a  function  such  that  7 (n)  — >  00  as  n  — >  00.  We  will  optimize  over  the  choice 
of  7 (n)  later.  The  same  construction  is  used  for  the  scheme  over  A{af),  and  so  on.  In 
general,  our  scheme  operates  as  follows  at  level  t  >  0  of  the  hierarchy  (or  recursion). 
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In  order  to  achieve  rate  pg  for  any  permutation  traffic  matrix  Auc(t7)  over 

a  n 

1  2eY(n) 

nodes  in  A(a^),  with 


use  as  a  building  block  a  scheme  achieving  rate  /7+1  over  77+ 1  nodes  in  A(ai+\)  for 
any  permutation  traffic  matrix  Auc(t7+i).  The  recursion  is  terminated  at  some  level 
L(n )  to  be  chosen  later. 

We  now  describe  how  the  hierarchy  is  constructed  between  levels  l  and  i  +  1  for 
0  <  i  <  L(n).  Each  source-destination  pair  chooses  some  subsquare  as  a  relay  over 
which  it  transmits  its  message.  This  relaying  of  messages  takes  place  in  two  phases 
-  a  multiple  access  phase  and  a  broadcast  phase.  We  first  describe  the  selection  of 
relay  subsquares,  then  the  operation  of  the  network  during  the  multiple  access  and 
broadcast  phases,  and  finally  the  termination  of  the  hierarchical  construction. 


Setting  up  Relays 

Given  rq  nodes  in  A(ag),  divide  the  square  region  A{ag)  into  7 (n)  equal  sized  sub¬ 
squares.  Denote  them  by  {Ak(ae+ Call  a  subsquare  dense  if  it  contains  at 
least  77/27(71)  =  n^+i  nodes.  In  other  words,  a  dense  subsquare  contains  a  number  of 
nodes  of  at  least  a  l/2£+1  fraction  of  its  area.  We  show  that  since  the  nodes  in  A(ae) 
have  constant  minimum  separation  rmin,  a  subsquare  can  contain  at  most  0(07+1)  (i.e. 
0(07/7(71)))  nodes,  and  hence  that  there  are  at  least  Q(2~Y(n))  dense  subsquares. 
Each  source-destination  pair  chooses  a  dense  subsquare  such  that  both  the  source 
and  the  destination  are  at  a  distance  Q(Yae+i)  from  it.  We  call  this  dense  subsquare 
the  relay  of  this  source-destination  pair.  We  show  that  the  relays  can  be  chosen  such 
that  each  relay  subsquare  has  at  most  77+ 1  communication  pairs  that  use  it  as  relay, 
and  we  assume  this  worst  case  in  the  following  discussion. 
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Figure  3-2:  Description  of  the  multiple  access  phase  at  level  t  in  the  hierarchy  with 
m  =  ri(+ 1 .  The  hrst  system  block  represents  the  wireless  channel,  connecting  source 
nodes  with  relay  nodes  The  second  system  block  are  quantizers 

used  at  the  relay  nodes.  The  third  system  block  represents  using  ng+ i  times 
the  communication  scheme  at  level  t-\- 1  (organized  as  ng+\  permutation  traffic  matri¬ 
ces  {^kC(n£+\)}k=i  )  to  “transpose”  the  matrix  of  quantized  observations 
In  other  words,  before  the  third  system  block,  node  iq  has  access  to  and 

after  the  third  system  block,  node  V\  has  access  to  ■  The  fourth  system  block 

are  matched  hlters  used  at  the  relay  nodes. 


Multiple  Access  Phase 

Source  nodes  that  are  assigned  to  the  same  (dense)  relay  subsquare  send  their  mes¬ 
sages  simultaneously  to  that  relay.  We  time  share  between  the  @(2~£7 (n))  different 
relay  subsquares.  If  the  nodes  in  the  relay  subsquare  could  cooperate,  we  would  be 
dealing  with  a  MAC  with  at  most  rq+ 1  transmitters,  each  with  one  antenna,  and  one 
receiver  with  at  least  ng+\  antennas.  In  order  to  achieve  this  cooperation,  communi¬ 
cation  within  the  relay  subsquare  is  necessary.  To  this  end,  each  node  in  the  relay 
subsquare  quantizes  its  observations  and  then  exchanges  these  quantized  observations 
with  the  other  nodes  in  the  relay  subsquare.  This  exchange  is  performed  using  a  hier¬ 
archical  (i.e.,  recursive)  construction.  For  this  recursive  construction,  assume  that  we 
have  access  to  a  communication  scheme  to  transmit  data  according  to  a  permutation 
traffic  matrix  Auc(n£+i)  between  rq+ 1  nodes  located  in  a  square  of  area  ag+ 1.  We  now 
show  how  this  scheme  at  scale  ag+\  can  be  used  to  construct  a  scheme  for  scale  ag 
(see  Figure  3-2). 

Suppose  there  are  ng+ 1  source  nodes  u\ , .  . urH+l  (located  anywhere  in  A(ag)) 
that  relay  their  message  over  the  ng+ 1  relay  nodes  iq, . . .  ,vne  (located  in  the  same 
dense  subsquare  of  area  ag+ 1).  Each  source  node  n*  divides  its  message  bits  into  ng+ 1 
parts  of  equal  length.  Denote  by  xgj  the  encoded  part  j  of  the  message  bits  of  node 
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Ui  ( Xij  is  really  a  large  sequence  of  channel  symbols;  to  simplify  the  exposition,  we 
shall,  however,  assume  it  is  only  a  single  symbol).  The  message  parts  corresponding 
to  {xij  jyT)1  will  be  relayed  over  node  vv  as  will  become  clear  in  the  following.  Sources 
transmit  at  time  j  for  j  G  {1, . . .  ri£+i}. 

Let  ykj  be  the  observed  channel  output  at  relay  Vk  at  time  j.  Note  that  ykj  depends 
only  on  channel  inputs  In  order  to  decode  the  message  parts  corresponding 

to  at  relay  node  Vj,  it  needs  to  obtain  the  observations  {l/ij}^1  from  all 

other  relay  nodes.  In  other  words,  all  relays  need  to  exchange  information.  For  this, 
each  relay  Vk  quantizes  its  observation  {ykj}'j=\  at  an  appropriate  rate  K  independent 
of  n  to  obtain  {ykj}nj=i  ■  Quantized  observation  ykj  is  to  be  sent  from  relay  vk  to  relay 
Vj.  Thus,  each  of  the  n^+i  relay  nodes  now  has  a  message  of  size  K  for  every  other 
relay  node. 

This  communication  demand  within  the  relay  subsquare  can  be  organized  as  ne+ 1 
permutation  traffic  matrices  between  the  ri(+\  relay  nodes.  Note  that 

these  relay  nodes  are  located  in  the  same  square  of  area  ci(+\ .  In  other  words,  we  are 
now  faced  with  the  original  problem,  but  at  smaller  scale  a£+\.  Therefore,  using  ne+ 1 
times  the  assumed  scheme  for  transmitting  according  to  a  permutation  traffic  matrix 
for  rq+i  nodes  in  A(ag+ 1),  relay  Vj  can  obtain  all  quantized  observations  {{/ij}™^1-  Now 
Vj  uses  ri£+i  matched  Liters  on  to  obtain  estimates  {xij}™^  of  {xy}^1.  In 

other  words,  each  node  Vj  computes1 

.  . 

h  VTjK^WVt> 

for  every  i  e  {1, . . . ,  n^+i}.  Using  these  estimates  it  then  decodes  the  messages 
corresponding  to  . 


1Note  that,  since  we  assume  full  CSI,  node  Vj  has  access  to  the  channel  gains  {hUitVk\j]}i.k  at 
any  time  t  >  j.  In  particular,  this  is  the  case  at  the  time  the  matched  filtering  is  performed. 
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Figure  3-3:  Description  of  the  broadcast  phase  at  level  i  in  the  hierarchy  with  m  = 
ri(+\.  The  first  system  block  represents  transmit  beamforming  at  each  of  the  relay 
nodes  {/ty} .  The  second  system  block  are  quantizers  used  at  the  relay 

nodes.  The  third  system  block  represents  using  ng+ 1  times  the  communication  scheme 
at  level  £  +  1  (organized  as  ng+ 1  permutation  traffic  matrices  {Aj^n^+i)}^1)  to 
“transpose”  the  matrix  of  quantized  beamformed  channel  symbols  I11  other 

words,  before  the  third  system  block,  node  iq  has  access  to  {xt\  j^1,  and  after  the 
third  system  block,  node  iq  has  access  to  {x\ jYj=i-  The  fourth  system  block  is  the 
wireless  channel,  connecting  relay  nodes  with  destination  nodes  {  . 


Broadcast  Phase 

Nodes  in  the  same  relay  subsquare  then  send  their  decoded  messages  simultaneously 
to  the  destination  nodes  corresponding  to  this  relay.  We  time  share  between  the 
different  relay  subsquares.  If  the  nodes  in  the  relay  subsquare  could  cooperate,  we 
would  be  dealing  with  a  BC  with  one  transmitter  with  at  least  ng+ 1  antennas  and  with 
at  most  ng+i  receivers,  each  with  one  antenna.  In  order  to  achieve  this  cooperation,  a 
similar  hierarchical  construction  as  for  the  MAC  phase  is  used.  As  in  the  MAC  phase, 
assume  that  we  have  access  to  a  scheme  to  transmit  data  according  to  a  permutation 
traffic  matrix  Auc(rq+i)  between  rig+i  nodes  located  in  a  square  of  area  ag+\.  We 
again  use  this  scheme  at  scale  cq+i  in  the  construction  of  the  scheme  for  scale  ag  (see 
Figure  3-3). 

Suppose  there  are  ng+i  relay  nodes  iq, . . ,  ,vne  (located  in  the  same  dense  sub¬ 
square  of  area  ag+ 1)  that  relay  traffic  for  ng+ 1  destination  nodes  uq, . . . ,  wne+1  (located 
anywhere  in  A(ai)).  Recall  that  at  the  end  of  the  MAC  phase,  each  relay  node  v3  has 
(assuming  decoding  was  successful)  access  to  parts  j  of  the  message  bits  of  all  source 
nodes  .  Node  v3  re-encodes  these  parts  independently;  call  {xu-}”Tj1  the  en¬ 

coded  channel  symbols  (as  before,  we  assume  Xg3  is  only  a  single  symbol  to  simplify 
exposition).  Relay  node  v3  then  performs  transmit  beamforming  on  for  the 
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rif+\  transmit  antennas  of  {vk}k=i  to  be  sent  at  time  T  +  j  (for  some  appropriately 
chosen  T  >  0  not  depending  on  j).  Call  Xkj  the  resulting  channel  symbol  to  be  sent 
from  relay  node  v^.  Then2 

_  ^vk,wX^  d"  j\ 

II’  y7 EEf+fI,r 

In  order  to  actually  send  this  channel  symbol,  relay  node  Vk  needs  to  obtain  from 
node  Vj.  Thus,  again  all  relay  nodes  need  to  exchange  information. 

To  enable  local  cooperation  within  the  relay  subsquare,  each  relay  node  v3  quan¬ 
tizes  its  beamformed  channel  symbols  {xkjYk=i  an  appropriate  rate  iClog(n)  with 
K  independent  of  n  to  obtain  {xkjYk=\  ■  Now,  quantized  value  x^  is  sent  from  relay 
Vj  to  relay  Vk-  Thus,  each  of  the  ng+ 1  relay  nodes  now  has  a  message  of  size  K  log(n) 
for  every  other  relay  node. 

This  communication  demand  within  the  relay  subsquare  can  be  organized  as  ng+ 1 
permutation  traffic  matrices  between  the  rq+ 1  relay  nodes.  Note 

that  these  relay  nodes  are  located  in  the  same  square  of  area  ag+ Hence,  we  are 
again  faced  with  the  original  problem,  but  at  smaller  scale  ag+ Using  ng+ 1  times 
the  assumed  scheme  for  transmitting  according  to  a  permutation  traffic  matrix  for 
ng. |_i  nodes  in  A(ag+ 1),  relay  Vk  can  obtain  all  quantized  beamformed  channel  symbols 
Now  each  Vk  sends  Xkj  over  the  wireless  channel  at  time  instance  T  +  j 
(with  T  chosen  to  account  for  the  preceding  MAC  phase  and  the  local  cooperation  in 
the  BC  phase).  Call  yij  the  received  channel  output  at  destination  node  Wi  at  time 
instance  T  +  j.  Using  yl3 ,  destination  node  Wi  can  now  decode  part  j  of  the  message 
bits  of  its  source  node  Wj. 

Spatial  Re-Use  and  Termination  of  Recursion 

The  scheme  performs  appropriately  weighted  time  division  multiplexing  among  dif¬ 
ferent  levels  0  <  t  <  L(n).  Within  any  level  t  >  1,  multiple  regions  of  the  original 

2Note  that,  since  we  only  assume  causal  CSI,  relay  node  Vj  does  not  actually  have  access  to 
{hVk,wi[T  +  j]}k,i  at  the  time  the  beamforming  is  performed.  This  problem  can,  however,  be  cir¬ 
cumvented.  The  details  are  provided  in  the  proofs  (see  Lemma  3.10). 
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square  A(n)  of  area  n  are  being  operated  in  parallel.  The  details  related  to  the  effects 
of  interference  between  different  regions  operating  at  the  same  level  of  hierarchy  are 
discussed  in  the  proofs. 

The  recursive  construction  terminates  at  some  large  enough  level  L  =  L{n )  (to  be 
chosen  later).  At  this  scale,  we  have  ul  nodes  in  area  A  permutation  traffic 

matrix  at  this  level  comprises  til  source-destination  pairs.  These  transmissions  are 
performed  using  simple  time  sharing.  Again,  multiple  regions  in  the  original  square 
of  area  n  at  level  L  are  active  simultaneously. 

3.2.2  Achievable  Rates 

Here  we  present  a  back-of-the-envelope  calculation  of  the  per-node  rate  pHR(n)  achiev¬ 
able  with  the  hierarchical  relaying  scheme  described  in  the  previous  section.  The 
complete  proof  is  stated  in  Section  3.5.  We  assume  throughout  that  long  block  codes 
and  corresponding  optimal  decoders  are  used  for  transmission. 

Instead  of  computing  the  rate  achieved  by  hierarchical  relaying,  it  will  be  conve¬ 
nient  to  instead  analyze  its  inverse,  i.e.,  the  time  utilized  for  transmission  of  a  single 
message  bit  from  each  source  to  its  destination  under  a  permutation  traffic  matrix 
Auc(n).  Using  the  hierarchical  relaying  scheme,  each  message  travels  through  L  lev¬ 
els  of  the  hierarchy.  Call  Te(n)  the  amount  of  time  spent  for  the  transmission  of  one 
message  bit  between  each  of  the  source-destination  pairs  at  level  t  in  the  hierarchy. 
We  compute  T({n)  recursively. 

At  any  level  l  >  1,  there  are  multiple  regions  of  area  at  operating  at  the  same 
time.  Due  to  the  spatial  re-use,  each  of  these  regions  gets  to  transmit  a  constant 
fraction  of  time.  It  can  be  shown  that  the  addition  of  interference  due  to  this  spatial 
re-use  leads  only  to  a  constant  loss  in  achievable  rate.  Hence  the  time  required  to  send 
one  message  bit  is  only  a  constant  factor  higher  than  the  one  needed  if  region  A(at) 
is  considered  separately.  Consider  now  one  such  region  A(at).  By  the  time-sharing 
construction,  only  one  of  its  ©(2~£y (n))  dense  relay  subsquares  of  area  a^+i  is  active 
at  any  given  moment.  Hence  the  time  required  to  operate  all  relay  subsquares  is  a 
0(2_£7 (n))  factor  higher  than  for  just  one  relay  subsquare  separately.  Consider  now 
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one  such  relay  subsquare,  and  assume  ng+i  source  nodes  in  A(ag)  communicate  each 
ng. |_i  message  bits  to  their  respective  destination  nodes  through  a  MAC  phase  and 
BC  phase  with  the  help  of  the  rig+\  relay  nodes  in  this  relay  subsquare  of  area  ag+ , . 

In  the  MAC  phase,  each  of  the  rq+1  sources  simultaneously  sends  one  bit  to  each 
of  the  rif+i  relay  nodes.  The  total  time  for  this  transmission  is  composed  of  two  terms. 

i)  Transmission  of  rig+\  message  bits  from  each  of  the  rip+\  source  nodes  to  equally 
many  relay  nodes.  Since  we  time  share  between  @(2~f7 (n))  relay  subsquares, 
we  can  transmit  with  an  average  power  constraint  of  0(2~£j(n))  during  the 
time  a  relay  subsquare  is  active,  and  still  satisfies  the  overall  average  power 
constraint  of  1.  With  this  “bursty”  transmission  strategy,  we  require  a  total  of 

°(n!+1^£^)  =  °(”“  4V<-^(nK'2")  (3.1) 

channel  uses  to  transmit  ng+ 1  bits  per  source  node.  The  terms  on  the  left- 
hand  side  of  (3.1)  can  be  understood  as  follows:  ng+1  is  the  number  of  bits  to 
be  transmitted;  a“^2  is  the  power  loss  since  most  nodes  communicate  over  a 
distance  of  ©(a^2);  2~£j(n)  is  the  average  transmit  power;  np+  \  is  the  multiple- 
antenna  gain,  since  we  have  that  many  transmit  and  receive  antennas. 

ii)  We  show  that  constant  rate  quantization  of  the  received  observations  at  the 
relays  is  sufficient.  Hence  the  rip+  \  bits  for  all  sources  generate  0(ng+ 1)  trans¬ 
missions  at  level  t  +  1  of  the  hierarchy.  Therefore, 

0(ne+1Ti+1(n))  (3.2) 

channel  uses  are  needed  to  communicate  all  quantized  observations  to  their 
respective  relay  nodes. 

Combining  (3.1)  and  (3.2),  accounting  for  the  factor  2^fy (n)  loss  due  to  time  division 
between  relay  subsquares,  we  obtain  that  the  transmission  time  for  one  message  bit 
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from  each  source  to  the  relay  subsquare  in  the  MAC  phase  at  level  t  is 

7>MAC(n)  =  ojVy+^-^nK/2-1  +  rm(n)).  (3.3) 

Next,  we  compute  the  number  of  channel  uses  per  message  bit  received  by  the 
destination  nodes  in  the  BC  phase.  Similar  to  the  MAC  phase,  each  of  the  rq+i 
relay  nodes  has  ne+ 1  message  bits  out  of  which  one  bit  is  to  be  transmitted  to  each 
of  the  ri(+i  destination  nodes.  Since  there  are  rq+]  relay  nodes,  each  destination 
node  receives  rq+ 1  message  bits.  As  before  the  required  transmission  time  has  two 
components. 

i)  Transmission  of  the  encoded  and  quantized  message  bits  from  each  of  the  rq+ 1 
relay  nodes  to  all  other  relay  nodes  at  level  £  +  1  of  the  hierarchy.  We  show 
that  each  message  bit  results  in  ()({£  +  1 )  log  n)  quantized  bits.  Therefore, 
O  (ri£+i  {£  +  1 )  log  rij  bits  need  to  be  transmitted  from  each  relay  node.  This 
requires 

0(ni+1(£  +  1)  log(n)r<+i(n))  (3.4) 

channel  uses. 

ii)  Transmission  of  rq+i  message  bits  from  the  relay  nodes  to  each  destination 
node.  As  before,  we  use  bursty  transmission  with  an  average  power  constraint 
of  @(2”£7 (n))  during  the  fraction  0(2£/y~1(n))  of  time  each  relay  subsquare  is 
active  (this  satishes  the  overall  average  power  constraint  of  1).  Using  this  bursty 
strategy  requires 

<"'-F4uh)=0("'+l4V<1"“/2,(’‘)’‘“/2'1)  (3-5) 

channel  uses  for  transmission  of  ri£+ 1  bits  per  destination  node.  As  in  the  MAC 
phase,  ri(+i  in  the  left  hand  side  of  (3.5)  can  be  understood  as  the  number 
of  bits  to  be  transmitted,  a“^2  as  the  power  loss  for  communicating  over  dis¬ 
tance  @(a^2),  2“£7 (n)  as  the  average  transmit  power,  and  rif+\  as  the  multiple- 


antenna  gain. 


Combining  (3.4)  and  (3.5),  accounting  for  a  factor  2_^7 (n)  loss  due  to  time  division 
between  relay  subsquares,  the  transmission  time  for  one  message  bit  from  the  relays 
to  each  destination  node  in  the  BC  phase  at  level  i  is 

rf c (n)  =  +  (£  +  1)  log(n)rm(n)) .  (3.6) 

From  (3.3)  and  (3.6),  we  obtain  the  following  recursion 

n{n)  =  b?MAC(n)  +  Tcc(n) 

=  o(tf'y1+t<‘1-a/2)(n)na/2-1  +  (£  +  1)  log(n)rm(n)) 

=  0^2L7(n)n"/2_1  +  L  \og{n)n+i{n)  ) ,  (3.7) 

where  we  have  used  a  >  2.  This  recursion  holds  for  all  0  <  t  <  L.  At  level  L ,  we  time 
share  among  rijj  nodes  in  region  A{ai)  with  a  permutation  traffic  matrix  A 
Each  of  the  n ^  source-destination  pairs  uses  the  wireless  channel  for  1  /til  fraction  of 
the  time  at  power  O('Ul),  satisfying  the  average  power  constraint.  This  achieves  a 
rate  of  at  least  U(aL“72)  between  any  source-destination  pair.  Equivalently 

TL(n)  =  0(aaL/2) 

=  0(na/2'y~La/2(n)) 

=  0(nQ/27“L(n)).  (3.8) 

Using  the  recursion  (3.7)  L  times  and  combining  with  (3.8),  we  have 
T0(n)  =  0{na^2~12Lry(n)  +  L  log(n)ri(n)) 

=  O  (nal2~l  (L  log(n))i2I'7(n)  +  (Llog(n))LrL(n)j 
=  0^na72_1(Llog(n))I'(2L7(n)  +  n'y~L(n))  j .  (3.9) 
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The  term 


(Llog(n))L(2L7(n)  +  ny  L(n)) 


(3.10) 


is  the  “loss”  factor  over  the  desired  order  nQ/2-1  scaling,  and  we  now  choose  the 
branching  factor  7 (n)  and  the  hierarchy  depth  L  =  L(n )  to  make  it  small.  Fix  a 
S  G  (0, 1/2)  and  set3 


L{n)  =  log1/2  5(n), 
7  (n)  =  n1/(L(n)+1). 


With  this 

(L(n)  log(n))L(n)  <  'T? log-1/2- ^ (n) log i°g(n) ^ 

2L^7(n)  <  nIog_1/2_'5(n)+1°g'5_1/2(n)) 

717 -i(n)(n)  <  nlog'“1/2(n). 

Since  <5  >  0,  the  nlog,S  1/2(n)  term  dominates  in  (3.9),  and  we  obtain 

r0(n)  <  6(n)na/2_1, 


where 

6(n)  <  n0(log5'1/2(n)). 

Hence  the  per-node  rate  of  the  hierarchical  relaying  scheme  is  lower  bounded  as 


p"R{n)  =  l/r0(n)  >  ft^n1-4*72, 


with 

6(n)  >  n-°(log5"1/2(n)). 


3There  are  several  choices  of  L(n)  and  y(n)  that  result  in  the  “loss”  factor  (3.10)  to  be  of  order 
n °0).  The  choice  here  is  convenient  for  the  slow  fading  case  discussed  in  detail  in  Chapter  3.5.2. 
However,  other  choices  are  possible  as  well. 
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Note  that  to  minimize  the  loss  term,  we  should  choose  5  >  0  to  be  small. 


3.3  Cooperative  Multi-Hop  Scheme 

In  this  section,  we  provide  a  brief  description  of  the  cooperative  multi-hop  scheme. 
The  details  of  the  construction  and  the  analysis  of  its  performance  can  be  found  in 
Section  3.8. 

Recall  that  a  node  placement  V(n)  is  //-regular  at  resolution  d(n )  if  every  square 
[ id(n ),  ( i  +  1  )d(n)]  x  [jd(n),  ( j  +  1  )d(n)]  for  some  i,j  G  N  contains  at  least  //d2(n ) 
nodes.  Given  such  a  node  placement  V(n),  divide  it  into  squares  of  sidelength  d(n). 
Consider  four  adjacent  squares,  combined  into  a  bigger  square  of  sidelength  2 d(n). 
By  the  regularity  assumption  on  V (n),  this  bigger  square  contains  at  least  4 /id2{n) 
nodes.  Hence  we  can  apply  the  hierarchical  relaying  scheme  introduced  in  Section  3.2 
to  support  any  permutation  traffic  within  this  bigger  square  at  a  per-node  rate  of 

&(n)(d2(n))1-“/2  =  b(n)d2~a(n), 

where  b{n)  is  essentially  of  order  n-log  1/2(n).  By  properly  choosing  the  permutation 
traffic  matrices  within  every  possible  such  bigger  square  of  sidelength  2 d(n)  and  with 
appropriate  spatial  re-use,  nodes  in  each  square  of  sidelength  d(n)  can  communicate 
with  neighboring  squares  at  a  sum  rate  of 

d2(n)b(n)d2~a(n)  =  b{n)dA~a{n). 

We  now  construct  a  graph  with  n/d2(n)  vertices,  each  corresponding  to  one  square 
of  sidelength  dn  in  A{n).  Nodes  corresponding  to  neighboring  squares  are  connected 
by  an  edge  with  edge  capacity 

b(n)dA~a(n). 

The  resulting  graph  is  depicted  in  Figure  3-4.  Note  that  with  the  above  communica¬ 
tion  procedure  if  messages  can  be  routed  over  this  graph  then  the  same  messages  can 


61 


Figure  3-4:  Graph  (in  bold)  resulting  from  the  construction  of  the  cooperative  multi¬ 
hop  scheme.  The  entire  square  has  sidelength  y7 n ,  and  the  dashed  squares  have 
sidelength  d(n).  Each  (bold)  edge  in  the  graph  corresponds  to  using  the  hierarchical 
relaying  scheme  between  the  nodes  in  the  adjacent  squares  of  sidelength  d(n). 


be  reliably  communicated  over  the  wireless  network. 

Now,  to  send  a  message  from  a  source  node  in  V(n)  to  its  destination  node,  we 
first  locate  the  squares  of  sidelength  d(n)  they  are  located  in.  We  then  route  the 
message  over  the  edges  of  the  graph  constructed  above  in  a  multi-hop  fashion.  By 
the  construction  of  the  graph,  each  such  edge  is  implemented  using  the  hierarchical 
relaying  scheme.  In  other  words,  we  perform  multi-hop  communication  over  distance 
\fn  with  hop  length  d(n),  and  each  such  hop  is  implemented  using  hierarchical  relaying 
over  distance  d(n).  Since  each  edge  in  the  graph  has  a  capacity  of  b{n)dA~a{n )  and 
has  to  support  roughly  nl/2d(n)  source-destination  pairs,  we  obtain  a  per-node  rate 
of 


pCMH(n)  >  6(n)d4  a(n)n  l^2d  1(n) 
=  b{n)d3~a  {n)n~1^ 2 


per  source-destination  pair. 
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3.4  Analysis  of  the  Hierarchical  Relaying  Scheme 


In  this  section,  we  analyze  in  detail  the  hierarchical  relaying  scheme.  Throughout 
Sections  3.4.1  to  3.4.3,  we  consider  communication  at  level  £,  0  <  £  <  L  =  L(n),  of 
the  hierarchy.  All  constants  Kt  are  independent  of  t. 

Recall  that  at  level  t  in  the  hierarchy,  we  have  a  square  region  A(ae)  of  area 


containing  (at  least) 

a  n 

£  2^7  e(n) 

nodes  V{ne).  We  divide  A(ae)  into  7 (n)  subsquares  of  area  07+1.  Recall  that  a 
subsquare  of  area  07+1  in  level  £  of  the  hierarchy  is  called  dense  if  it  contains  at  least 
rif+\  nodes.  We  impose  a  power  constraint  of  Pf(n)  =  Q(2~e'y(n))  during  the  time  any 
particular  relay  subsquare  is  active.  Since  we  time  share  between  0(2_£7(n))  relay 
subsquares,  this  satisfies  the  overall  average  power  constraint  (by  choosing  constants 
appropriately). 

Since  other  regions  of  area  07  are  active  at  the  same  time  as  the  one  under  con¬ 
sideration,  we  have  to  deal  with  interference.  To  this  end,  we  consider  a  slightly 
more  general  noise  model  that  includes  the  experienced  interference  at  the  relay  sub¬ 
squares.  More  precisely,  we  assume  that,  for  all  u  e  V(ne),  the  additive  noise  term 
{zu[f]}t  is  independent  of  the  signal  {xu[t]}t  and  of  the  channel  gains  {hUjV[t\ }„jt;  that 
the  noise  term  is  stationary  and  ergodic  across  time  t,  but  with  arbitrary  dependence 
across  nodes  u\  and  that  the  noise  has  zero  mean  and  bounded  power  N0  independent 
of  n.  Note  that  we  do  not  require  the  additive  noise  term  to  be  Gaussian.  In  the 
above,  N0  accounts  for  both  noise  (which  has  power  1  in  the  original  model)  as  well 
as  interference.  We  show  in  Section  3.5  that  these  assumptions  are  valid. 
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Recall  the  following  choice  of  7 (n)  and  L(n): 


L(n )  =  log1//2  s(n), 

7(71)  =  n1/(L(n)+1), 

with  5  G  (0, 1/2)  independent  of  n.  This  choice  satisfies 

7  (n)  <  7(h)  if  n  <  n, 
ryLH  (n)  <  n  for  all  n, 

2~L(n)j (n)  — >  00  as  n  — >  00, 


(3.11) 


(3.12) 


The  first  condition  in  (3.12)  implies  that  the  number  of  subsquares  7(71)  we  divide 
A  (77)  into  increases  in  n.  The  second  condition  implies  that  the  subsquare  area  aL(n) 
at  the  last  level  of  the  hierarchy  is  bigger  than  1.  As  we  shall  see,  the  third  condition 
implies  that  the  number  of  dense  subsquares  at  the  last  level  of  the  hierarchy  (and 
hence  at  every  level)  grows  unbounded  as  n  — >  00  (see  Lemma  3.6  below). 

Throughout  Section  3.4,  we  consider  the  fast  fading  channel  model.  Slow  fading 
is  discussed  in  Section  3.5.2. 


3.4.1  Setting  up  Relays 

The  first  lemma  states  that  the  minimum-separation  requirement  rmin  G  (0, 1)  implies 
that  a  constant  fraction  of  subsquares  must  be  dense.  We  point  out  that  this  is  the 
only  consequence  of  the  minimum-separation  requirement  used  to  prove  Theorem  3.1. 
Thus  Theorem  3.1  remains  valid  if  we  just  assume  that  Lemma  3.6  below  holds 
directly.  See  also  Section  3.10.4  for  further  details. 

Lemma  3.6.  For  any  R(n^)  C  A(ag)  with  |l/(n^)|  >  rig  and  with  minimum  separation 
r min  £  (0, 1),  each  of  its  subsquares  of  area  ag+ 1  contains  at  most  Kiag/,y(n)  nodes, 
and  there  are  at  least  K22~e'y{n)  dense  subsquares,  for  some  constants  K1  and  K2. 

Proof.  Put  a  circle  of  radius  rmin/ 2  around  each  node.  By  the  minimum-separation 
requirement,  these  circles  do  not  intersect.  Each  node  covers  an  area  of  7rr^in/4. 
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Increasing  the  sidelength  of  each  subsquare  by  rmin,  this  provides  a  total  area  of 


(vV/7(™)  +  r min)  2  <  y~T  (1  +  r min)2 

7  (n) 

in  which  the  circles  around  these  nodes  are  packed.  Here  we  have  used  that  7 £+1(n)  < 
n  by  (3.12),  and  therefore 

7 (n)  <  n/'y{n)  =  a^. 

Hence  there  can  be  at  most  K\a(/~f{n)  nodes  per  subsquare  with 

TS  A  A  Hnin) 

Ai  —  4 - 2 - • 

nr  ■ 

mm 

Note  that,  since  rmin  <  1,  we  have  AT  >  1. 

Let  d(ne)  be  the  number  of  dense  subsquares  in  A(ae),  and  therefore  7 (n)  —  d(ng)  is 
the  number  of  subsquares  that  are  not  dense.  By  the  argument  in  the  last  paragraph, 
each  dense  subsquare  contains  at  most  A" 77/7 (n)  nodes,  and  those  subsquares  that 
are  not  dense  contain  less  than  rif+\  nodes  by  the  definition  of  dense  subsquares. 
Hence  d(ne)  must  satisfy 

d(ne)Kiag/'y(n)  +  (7(71)  -  d(ne))ne+1  >  \V(ne)\  >  nt. 


Thus,  using  a e  =  2 eri£,  ng+ 1  =  ne/2ry(n),  we  have 


d(ne)Ki2e  +  (7 (n)  -  d(ne))/2  >  7(77). 


As  AT  >  1,  this  yields 


d(ne)  > 


1  -  1/2 
K\2e  -  1/2 


7(n)  > 


1^7(71)  =  ^2  f7(n), 


with 


AT  = 


1 

2AY 


□ 
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Consider  V(ne)  C  A(ae)  with  |V(n^)|  >  ng,  and  choose  arbitrary  K22~iry{n) 
dense  subsquares  of  area  a^+ i  (as  guaranteed  by  Lemma  3.6).  Call  those  subsquares 
{Ak(ae+ 7(,l\  For  each  source-destination  pair,  we  now  select  one  such  dense 
subsquare  to  relay  traffic  over.  To  avoid  bottlenecks,  this  selection  has  to  be  done 
such  that  all  relay  subsquares  carry  approximately  the  same  amount  of  traffic.  More¬ 
over,  for  technical  reasons,  the  distances  from  the  source  and  the  destination  to  the 
relay  subsquare  cannot  be  too  small. 

Formally,  the  selection  of  relay  subsquares  can  be  described  by  the  schedules  S  G 
{0,  i}n^xA22  em(n)  with  su,k  =  1  if  source  node  u  relays  traffic  over  dense  subsquare 
k,  and  S  G  {0,  l}Aa2  try(n)xnt  with  sk,w  =  1  if  destination  node  w  receives  traffic 
from  dense  subsquare  k.  With  slight  abuse  of  notation,  let  rUjAk(ae+1)  be  the  distance 
between  node  u  G  V{ng)  and  the  closest  point  in  Ak(ae+i),  i.e., 

ru,Ak(ae+1)  —  min  rUjV.  (3.13) 

+  ’  veAk(ae+1) 

Define  the  sets 

S(m)  ^{se  {0. i jn.xM-sw  :0  <  _ Suk  <  „(+1  Vfc, 

0  <  E  £f7("W  <  1  V«, 

Su,t  =  1  implies  r„A(„(+1)  >  Vi i,k\  (3.14) 

and 

S(n?)  ±{Se  {0,  1}A'22-C(n)xn,  .  gT  £  j. 

The  sets  S(ri£)  and  S(nt)  are  the  collection  of  schedules  satisfying  the  conditions 
mentioned  in  the  last  paragraph.  More  precisely,  the  first  condition  in  (3.14)  ensures 
that  at  most  rq+i  source-destination  pairs  relay  over  the  same  dense  subsquare,  the 
second  condition  ensures  that  each  source-destination  pair  chooses  at  most  one  relay 
subsquare,  and  the  third  condition  ensures  that  sources  and  destinations  are  at  least 
at  distance  y/2ae+i  from  the  chosen  relay  subsquare. 

Next,  we  prove  that  any  node  placement  that  satisfies  Lemma  3.6  allows  for  a  de- 
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composition  of  any  permutation  traffic  matrix  Auc(rq)  into  a  small  number  of  sched¬ 
ules  belonging  to  S(ng)  and  S(ng). 

Lemma  3.7.  There  exists  K3  such  that  for  all  n  large  enough  (independent  of  i), 
and  every  permutation  traffic  matrix  Auc(rq)  G  {0,  lfnexne  we  can  find  K32e  schedules 
C  S(ne),  {S^\ng)}fjf  C  S(ne)  satisfying 


K32e 


AUCM  =  ^  S®(ne)S®{ne), 
1=1 


for  some  constant  K3. 


Proof.  Pick  an  arbitrary  source-destination  pair  in  Auc(rq),  and  consider  the  sub¬ 
squares  containing  the  source  and  the  destination  node.  Since  each  subsquare  has 
side  length  y/ag+ 1,  there  are  at  most  50  subsquares  at  distance  less  than  y/2ag+i  from 
either  of  those  two  subsquares.  As  (n)  — >  oo  as  n  — >  oo  by  (3.12),  there  exists 

K  (independent  of  £)  such  that  for  n  >  K  we  have  50  <  K22~£~1'y(n).  Since  there 
are  at  least  K22~e'y(n )  dense  subsquares  by  Lemma  3.6,  there  must  exist  at  least 
K22~e~1'y(n)  dense  subsquares  that  are  at  distance  at  least  \J2ai+\  from  both  the 
subsquares  containing  the  source  and  the  destination  node. 

In  order  to  construct  a  decomposition  of  Auc(n^),  we  use  the  following  procedure. 
Sequentially,  each  of  the  rq  source-destination  pairs  chooses  one  of  the  (at  least) 
A'22_f_17(n)  dense  subsquares  at  distance  at  least  \J2ai+\  that  has  not  already  been 
chosen  by  rq+i  other  pairs.  If  any  source-destination  pair  cannot  select  such  a  sub¬ 
square,  then  stop  the  procedure  and  use  the  source-destination  pairs  matched  with 
dense  subsquares  so  far  to  define  matrices  S^^ng)  and  S^(ng).  Now,  remove  all  the 
matched  source-destination  pairs,  forget  that  dense  subsquares  were  matched  to  any 
source-destination  pair  and  repeat  the  above  procedure,  going  through  the  remaining 
source-destination  pairs. 

Let 

K3  =  4 /K2. 


We  claim  that  by  repeating  this  process  of  generating  matrices  S^l\ng)  and  S^(ng), 
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we  can  match  all  source-destination  pairs  to  some  dense  subsquare  with  at  most 
K32e  such  matrices.  Indeed,  a  new  pair  of  matrices  is  generated  only  when  a  source- 
destination  pair  cannot  be  matched  to  any  of  its  available  (at  least)  K22~e~1'y(n) 
dense  subsquares.  If  this  happens,  all  these  dense  subsquares  are  matched  by  rig+i  = 
ng/2ry(n)  pairs.  Hence  at  least  K22~e~2ng  source-destination  pairs  are  matched  in 
each  “round” .  Since  there  are  ng  total  pairs,  we  need  at  most 


rq 

K22-t~*ne 


K32l 


matrices  S^(ng)  and  S^(ng). 


□ 


For  a  permutation  traffic  matrix  Auc(rq),  communication  proceeds  as  follows. 
Write 


*r32e 


\uc(ne)  =  J2S{i)(ng)SW(ng) 

i= 1 

as  in  Lemma  3.7.  Split  time  into  K32l  equal  length  time  slots.  In  slot  i,  we  use 
S^\ng) S^\n g)  as  our  traffic  matrix.  Consider  without  loss  of  generality  7  =  1  in  the 
following.  Write 


Ay2  C(n) 

SW(ne)SW(ng)  =  S^k\ne+1)S^k\ne+1), 

k= 1 


where  S^1,k\ng+i)S^1,k\ng+i)  is  the  traffic  relayed  over  the  dense  subsquare  Ak(ag+ 1). 
We  time  share  between  the  schedules  for  k  e  {1, . . . ,  K22~l^(n)} .  Consider  now  any 
such  k.  In  the  worst  case,  there  are  exactly  rq+i  communication  pairs  to  be  relayed 
over  Ak(ag+i),  and  the  relay  subsquare  Ak(ag+ 1)  contains  exactly  rq+]  nodes.  We 
shall  assume  this  worst  case  in  the  following. 

We  focus  on  transmission  according  to  the  traffic  matrix  S'(1,1)(7q+i)S'(1,1)(7q+i). 
Let  V(ng+ 1)  be  the  nodes  in  Ai(ag+\),  and  let  U{ng+\)  and  W(ng+i)  be  the  source  and 
destination  nodes  of  S'd’b(7i£+1)5'(1>1)(ri£+1),  respectively.  In  other  words,  the  source 
nodes  U(ng+ 1)  communicate  to  their  respective  destination  nodes  W(ng+i)  using  the 
nodes  V(ng+i)  as  relays. 


Figure  3-5:  Sketch  of  the  quantized  channel.  /  and  ip  are  the  channel  encoder  and  de¬ 
coder,  respectively;  {(lk}k=i  are  quantizers;  Py\x  and  Px\y  represent  stationary  ergodic 
channels  with  the  indicated  marginal  distributions. 

3.4.2  Multiple  Access  Phase 

Each  source  node  in  U(ri£+i)  splits  its  message  into  rq+1  equal  length  parts.  Part  j  at 
every  node  u  G  U(ri£+ 1)  is  to  be  relayed  over  the  j'-tli  node  in  V(ri£+ 1).  Each  part  is 
separately  encoded  at  the  source  and  separately  decoded  at  the  destination.  After  the 
source  nodes  are  done  transmitting  their  messages,  the  nodes  in  the  relay  subsquare 
quantize  their  (sampled)  observations  corresponding  to  part  j  and  communicate  the 
quantized  values  to  the  j-th  node  in  the  relay  subsquare.  This  node  then  decodes  the 
j'-tli  message  parts  of  all  source  nodes.  Note  that  this  induces  a  uniform  traffic  pattern 
between  the  nodes  in  the  relay  subsquare,  i.e.,  every  node  needs  to  transmit  quantized 
observations  to  every  other  node.  While  this  traffic  pattern  does  not  correspond  to 
a  permutation  traffic  matrix,  it  can  be  written  as  a  sum  of  ri£+ 1  permutation  traffic 
matrices.  A  l/ri£+\  fraction  of  the  traffic  within  the  relay  subsquare  is  transmitted 
according  to  each  of  these  permutation  traffic  matrices.  This  setup  is  depicted  in 
Figure  3-2  in  Section  3.2.1. 

Assuming  for  the  moment  that  we  have  a  scheme  to  send  the  quantized  obser¬ 
vations  to  the  dedicated  node  in  the  relay  subsquare,  the  traffic  matrix 
between  U(ri£+ 1)  and  V(ri£+ 1)  describes  then  a  MAC  with  ri£+ 1  transmitters,  each 
with  one  antenna,  and  one  receiver  with  rip+\  antennas.  We  call  this  the  MAC  in¬ 
duced  by  in  the  following.  Before  we  analyze  the  rate  achievable  over  this 

induced  MAC,  we  need  an  auxiliary  result  on  quantized  channels. 

Consider  the  quantized  channel  in  Figure  3-5.  Here,  /  is  the  channel  encoder,  ip 
the  channel  decoder,  {gfc}™=1  quantizers.  All  these  have  to  be  chosen.  Py\x  and  Px\y, 
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on  the  other  hand,  represent  fixed  stationary  ergodic  channels  with  the  indicated 
marginal  distributions.  We  call  R  the  rate  of  the  channel  code  (/,  ip)  and  {/4}™=1 
the  rates  of  quantizers  {qk}™=  i- 

Lemma  3.8.  If  there  exist  distributions  Px  and  {Pyk\yk}™=i  such  that  R  <  I(x\x) 
and  Rk  >  I(yk',yk),  \/k,  then  (/?,,  {Rk}k Li)  is  achievable  over  the  quantized  channel. 

Proof.  The  proof  follows  from  a  simple  extension  of  Theorem  1  in  Appendix  II  of  [38] . 

□ 

Lemma  3.9.  Let  the  additive  noise  {zv}vev(ne+1)  be  uncorrelated  (over  v).  For  the 
MAC  induced  by  S^1,1^  (ri£+ 1)  with  per-node  average  power  constraint  P((n)  <  nfffpa^2 , 
a  rate  of 

pTACi:n )  >  KAPz(n)ni+1afa/2 

per  source  node  is  achievable,  and  the  number  of  bits  required  at  each  relay  node  to 
quantize  the  observations  is  at  most  K5  bits  per  rq+ 1  total  message  bits 4  sent  by  the 
source  nodes,  for  some  constants  Jl4  and  K5 . 

Proof.  The  source  nodes  send  signals  with  a  power  of  (essentially)  for  a 

fraction  P( (n)ne+\ae  a ^  <  1  of  time  and  are  silent  for  the  remaining  time.  To  ensure 
that  interference  is  uniform,  the  time  slots  during  which  the  nodes  send  signals  are 
chosen  randomly  as  follows.  Generate  independently  for  each  region  A(a ()  a  Bernoulli 
process  {i3 [t] with  parameter  P((n)ne+iae  Q^2/(l  +  rj)  <  1  for  some  small  q  >  0. 
The  nodes  in  A(a^)  are  active  whenever  B[t\  —  1  and  remain  silent  otherwise.  Since 
the  blocklength  of  the  codes  used  is  assumed  to  be  large,  this  satisfies  the  average 
power  constraint  of  Pe(n)  with  high  probability  for  any  q  >  0.  Since  we  are  interested 
only  in  the  scaling  of  capacity,  we  ignore  the  additional  1/(1 +  77)  term  in  the  following 
to  simplify  notation.  Clearly,  we  only  need  to  consider  the  fraction  of  time  during 
which  B[t\  =  1. 

Let  y  be  the  received  vector  at  the  relay  subsquare,  y  the  (componentwise)  quan- 
4Total  message  bits  refers  to  the  sum  of  all  message  bits  transmitted  by  the  m+ 1  source  nodes. 
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tized  observations.  We  use  a  matched  filter  at  the  relay  subsquare,  i.e. , 


x v,  iii  |i  Vi 

\\hu\\ 

where  column  vector  hu  =  {hu,v} V&v (nt+1)  are  the  channel  gains  between  node  u  G 
U(ne+ 1)  and  the  nodes  in  the  relay  subsquare  V(ne+i).  The  use  of  a  matched  hlter  is 
possible  since  we  assume  full  CSI  is  available  at  all  the  nodes. 

We  now  use  Lemma  3.8  to  show  that  we  can  design  quantizers  {qv}v&v{nl+1)  of  con¬ 
stant  rate  and  achieve  a  per-node  communication  rate  of  at  least  K^P^{n)n^+\aJa^2 . 
The  first  channel  in  Lemma  3.8  (see  Figure  3-5)  will  correspond  to  the  wireless  chan¬ 
nel  between  a  source  node  u  and  its  relay  subsquare  V(n^+ 1).  The  second  “channel” 
in  Lemma  3.8  will  correspond  to  the  matched  hlter  used  at  the  relay  subsquare.  To 
apply  Lemma  3.8,  we  need  to  find  a  distribution  for  xu  and  for  yv\yv.  Define 

L/  W,j4i(a£^_i)/ A  1 

with  r^A^ae +i)  as  'n  (3-13),  to  be  the  normalized  distance  of  the  source  node  u  G 
f/(n£+i)  to  the  relay  subsquare  Ai(eq+i).  For  each  u  G  f/(n£+i),  let 

xu  ~  Ac(0,  Kn^lx^2) 

independent  of  for  u  A  u,  and  let  yv  =  yv  +  zv  for  zv  ~  Ac(0,  A2)  independent  of 
y  and  for  some  A2  >  0.  Note  that  the  channel  input  xu  has  power  that  depends  on 
the  normalized  distance  fu  (i.e.,  only  nodes  u  G  U(ri£+i)  that  are  at  maximal  distance 
y/2 a,£  from  the  relay  subsquare  transmit  at  full  available  power).  This  is  to  ensure 
that  all  signals  are  received  at  roughly  the  same  strength  by  the  relays. 

We  proceed  by  computing  I(yv;  yv\{hu,v})  and  I(xu]  xu\{hu^})  as  required  in 
Lemma  3.8  (the  conditioning  on  being  due  to  the  availability  of  full  CSI). 

Note  first  that  by  construction  of  S'^1,1^(n^+i)  (see  (3.14)),  we  have  for  u  G  U(ri(+ 1) 
and  v  G  V  (rq+i) 

Li,Ai(a^+1)  —  ru,v  A  2 rUtA1(ae+1)i 
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and  hence 


1 


1 


< 


< 


2^/2n£  r  u,v 

From  this,  and  since  \hUfV\ 2  =  r~“,  we  obtain 


(3.15) 


2-3“/2a(~“/2  <  <  2-“/2a(““/2, 

2 -3"/2n(+1a(““/2  <  llh„ll2f.“  <  2~“/2n,i,a7“/2. 


(3.16) 


Mil  '  U  — 


W't+lQ'g 


We  start  by  computing  /(y„;  ^|{/iu,c})-  We  have 


t/ll  ^  ^  hU,VXU  Zv  -\-  Zy , 

u£U(ne+i) 


and  hence  yv  has  mean  zero  and  variance 


E(|y„|2)  =  ^  {K.vKn^af2  +  N0  +  A2 

ue(7(n^+i) 

<  +  iV0  +  A2 

=  2-“/2  +  A^0  +  A2, 


where  we  have  used  (3.16).  Hence 


Vv  | \hu,v  })  h(Slv  |  {^ti,t/})  h({)v  IVvi  {^u,v  } ) 


<  log  (27reE(|^|2))  -  log(27reA2) 

<  log  (2vre(2-“/2  +  N0  +  A2))  -  log(27reA2) 


=  log 


2-“/2  +  N0\ 
A2  / 


(3.17) 


We  now  compute  I{xu\  xu\{hu^}).  We  have 


Xu  1 1  hu  1 1  xu  -1~ 


ueu(ne+  i)\{u} 


ht  h  _  /it 

tt  ^  +  ^(z  +  z) 


l/i„ 


|h,;, 
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Conditioned  on  {ha}u£u(ne+1), 


\\hu\\xu  ~  A7C(0,  \\hu\\ 2f“n£+1a“/2), 

and 

/  h^h-  h t  2  n 

E(|Eiw(„w,\W  +  *)|  |{M) 

=  nMaT  J2  f*i irp- + ^ + At 

u£U  (ne+1)\{u}  11 

where  we  have  used  the  assumption  that  {zv}v£v(ne+1)  are  uncorrelated.  Using  (3.16), 
this  is,  in  turn,  upper  bounded  by 

2S“/V>yi0?  5]  fi\htha\2  +  N,,  +  A2. 

ueU  (ne+1)\{u} 

Similarly,  we  can  lower  bound  the  received  signal  power  as 

E(||M2W2)  >  2_3a/2. 

Since  Gaussian  noise  is  the  worst  additive  noise  under  a  power  constraint  [18],  and 
applying  Jensen’s  inequality  to  the  convex  function  log(l  +  1/x),  we  obtain 
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E  \h  ll,V  |2|  K,v 

veV(ne+ 1) 

E 

v€V(ne+1) 


-a  -a 

'  U,V  1  U,V  5 


(3.19) 


and  hence  using  (3.15) 


E  f 


y  ru\,,'u"'u\ 

u€U  (ne+1)\{u} 


■I  h{hu 


=  E 

uGU (ne+1)\{u}  v£V(ne+1) 

<  2 -“n|+1a7“. 


E~°L  -a~OL  -OL 

1  U  1  U,V  'LL  1  U,v 


Therefore  we  can  continue  (3.18)  as 


I(xu,xu\{hu,v})  >  log 


2~3q/2 


2«/2  +  No  +  A2 


A 


AE 


(3.20) 


Using  (3.17)  and  (3.20)  in  Lemma  3.8,  and  observing  that  we  only  communicate 
during  a  fraction 

Pf,(n)nt+1aJa/2  <  1 

of  time  yields  a  per  source  node  rate  p^AC(n)  arbitrarily  close  to 


K4Pe{n)n£+iae 


■a/2 


and  a  quantizer  of  rate  arbitrarily  close  to 


log 


2 -q/2  +  1Vo\ 

A2  ) 


bits  per  observation  at  each  relay  node.  Since  by  (3.20)  the  mutual  information 
I(xu)  xu\{hy„v})  is  at  least  KA  for  every  u  G  U(ng+i)  during  the  fraction  of  time  we 
actually  communicate,  this  implies  that  there  are  at  most  1/A'4  observations  at  each 
relay  node  per  rig+\  total  message  bits.  Thus  the  number  of  bits  per  relay  node 
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required  to  quantize  the  observations  is  at  most 


K5^—  log  ( 1  + 

A  4 


2~a/2  +  Nq 

A2 


bits  per  ri(+\  total  message  bits  sent  by  the  source  nodes. 


□ 


3.4.3  Broadcast  Phase 

At  the  end  of  the  MAC  phase,  each  node  in  the  relay  subsquare  received  a  part  of  the 
message  sent  by  each  source  node.  In  the  BC  phase,  each  node  in  the  relay  subsquare 
encodes  these  messages  together  for  rq+i  transmit  antennas.  The  encoded  message 
is  then  quantized  and  communicated  to  all  the  nodes  in  the  relay  subsquare.  These 
nodes  then  send  the  quantized  encoded  message  to  the  destination  nodes  W(nt+i). 
Note  that  this  again  induces  a  uniform  traffic  pattern  between  the  nodes  in  the 
relay  subsquare,  i.e.,  every  node  needs  to  transmit  quantized  encoded  messages  to 
every  other  node.  While  this  traffic  pattern  does  not  correspond  to  a  permutation 
traffic  matrix  it  can  be  written  as  a  sum  of  n^+ 1  permutation  traffic  matrices.  A 
l/n^+i  fraction  of  the  traffic  within  the  relay  subsquare  is  transmitted  according  to 
each  of  these  permutation  traffic  matrices.  This  setup  is  depicted  in  Figure  3-3  in 
Section  3.2.1. 

Assuming  for  the  moment  that  we  have  a  scheme  to  send  the  quantized  en¬ 
coded  messages  to  the  corresponding  nodes  in  the  relay  subsquare,  the  traffic  matrix 
S'(1,1)(n^+i)  between  V(ji£+i)  and  W(ji£+i)  describes  then  a  BC  with  one  transmitter 
with  ri£+i  antennas  and  n^+  \  receivers,  each  with  one  antenna.  We  call  this  the  BC 
induced  by  in  the  following. 

Lemma  3.10.  For  the  BC  induced  by  with  per-node  average  power  con¬ 

straint  P({n )  <  nj^a^2 ,  a  rate  of 


PeC(n)  >  K&Pt(n)ni+1ata/2 


is  achievable  per  destination  node,  and  the  number  of  bits  required  to  quantize  the 
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observations  is  at  most  K7(£  +  1)  log(n)  bits  at  each  relay  node  per  rip+\  total  message 
bits 5  received  by  the  destination  nodes,  for  some  constants  K§  and  K7. 

Proof.  Consider  a  node  v  E  V (rq+i)  in  the  relay  subsquare,  say  the  hrst  one.  From 
the  MAC  phase,  this  node  received  the  first  part  of  the  messages  of  each  source 
node  u  E  U(ng+i).  We  would  like  to  jointly  encode  these  message  parts  at  the  relay 
node  using  transmit  beamforming,  and  then  transmit  the  corresponding  encoded 
signal  using  all  the  nodes  in  the  relay  subsquare.  However,  this  cannot  be  done 
directly  because  at  the  encoding  time,  the  future  channel  state  at  transmission  time 
is  unknown. 

We  circumvent  this  problem  by  reordering  the  signals  to  be  transmitted  at  the 
relay  nodes  as  follows.  Let 


a  2 

\@v,w  J  vE  V(ne+1),weW(n(+1)  £  {0,  7t/2,  7T,  37r/2}''^+1. 

be  a  “quantized”  channel  state.  The  part  of  the  messages  at  node  v  in  the  relay 
subsquare  is  encoded  for  ng+ 1  transmit  nodes  with  an  assumed  channel  gain  of 

hv,w[t]  =  rfffj2  exp ( y/—19VjW [f ] ) , 

where  the  {9V}W[f\}VjWtt  are  cycled  as  a  function  of  t  through  all  possible  values  in 
{0,  7t/2,  7t,  37r/2}n«+1.  The  components  of  the  encoded  messages  are  then  quantized 
and  each  component  sent  to  the  corresponding  node  in  the  relay  subsquare.  Once  all 
nodes  in  the  relay  subsquare  have  received  the  encoded  message,  they  send  in  each 
time  slot  a  sample  of  the  encoded  messages  corresponding  to  the  quantized  channel 
state  closest  (in  Euclidean  distance)  to  the  actual  channel  realization  in  that  time 
slot.  By  ergodicity  of  {0U;V[t]}t,  each  quantized  channel  state  is  used  approximately 
the  same  number  of  times.  More  precisely,  as  the  message  length  grows  to  infinity, 
we  can  send  samples  of  the  encoded  message  parts  a  1/(1  +  rj)  fraction  of  time  with 
probability  approaching  1  for  any  r]  >  0.  Since  we  have  no  constraint  on  the  encoding 

5Total  message  bits  refers  to  the  sum  of  all  message  bits  received  by  the  ng+ 1  destination  nodes. 
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delay  in  our  setup,  we  can  choose  r]  arbitrarily  small,  and  given  that  we  are  only 
interested  in  scaling  laws,  we  will  ignore  this  term  in  the  following  to  simplify  notation. 
Note  that  the  destination  nodes  can  reorder  the  received  samples  since  we  assume  full 
CSI.  In  the  following,  we  let  {9v,w}VjW  be  the  random  quantized  channel  state  induced 
by  {Ov,w}v,w  through  the  above  procedure.  Denote  by  {hVj w}v,w  the  corresponding 
channel  gains. 


As  in  the  MAC  phase,  the  nodes  in  the  relay  subsquare  send  signals  at  a  power 
(essentially)  nj^a^2  a  fraction  Pijn)ne+iaea^2  <  1  of  time  and  are  silent  for  the 
remaining  time.  To  create  interference  at  uniform  power,  this  is  done  in  the  same 
randomized  manner  as  in  the  MAC  phase.  Generate  independently  for  each  region 
A{aj)  a  Bernoulli  process  { A? [t] with  parameter  Pt(n)nt+iae  a^2 /(I  +  rj)  for  some 
small  r)  >  0.  The  nodes  in  A(aj)  are  active  whenever  B[t]  =  1  and  remain  silent 
otherwise.  As  before,  we  ignore  the  additional  1/(1  +  rj)  term.  Again  we  only  need 
to  consider  the  fraction  of  time  during  which  B[t\  =  1. 


Consider  the  message  part  at  a  relay  node  for  destination  node  w  G  W(ne+j). 
We  encode  this  part  independently;  call  xw  the  encoded  message  part.  The  relay 
node  then  performs  transmit  beamforming  to  construct  the  encoded  message  for  all 
its  destination  nodes,  i.e., 


»=  £ 

weW(ne+ 1) 


Xw> 


where  row  vector  hw  =  {hv,w}v£v(ne+1)  contains  the  channel  gains  to  node  w,  and 
where  we  have  used  \hVjW\  =  \hVjW\.  The  relay  node  then  quantizes  the  vector  of 
encoded  messages  componentwise  and  forwards  the  quantized  version  x  to  the  other 
nodes  in  the  relay  subsquare.  These  nodes  then  send  x  over  the  channel  to  the 
destination  nodes.  The  received  signal  at  destination  node  w  is  thus 


Uw 


hwx  +  zu 


With  this,  we  have  the  setup  considered  in  Lemma  3.8  (with  different  variable 
names).  The  first  “channel”  in  Lemma  3.8  (see  Figure  3-5)  will  correspond  to  the 
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transmit  beamforming  used  at  the  relay  subsquare.  The  second  channel  in  Lemma  3.8 
will  now  correspond  to  the  wireless  channel  between  the  relay  subsquare  V(ne+ 1)  and 
a  destination  node  w.  To  apply  Lemma  3.8,  we  need  to  find  a  distribution  for  xw  and 
for  xv\xv.  We  also  need  to  guarantee  that  xv  satisfies  the  power  constraint  at  each 
node  v  in  the  relay  subsquare.  For  each  w  G  W(ri£+ 1),  let 

xw  ~  A/”c(0,  Kn^af2) 


(for  some  K  to  be  chosen  later)  independent  of  x &  for  w  ^  w,  and  let  xv  =  xv  +  zv 
for  zv  ~  Ac(0,  A2)  independent  of  x  and  for  some  A2  >  0.  We  then  have 


h„,hi. 


]Jw 


I  hlt 


~Xyj  “h 


E 

w£W(ne+1)\{w} 


KK 

1 1  h.7, 1 1 


~Xfr  T  h^z  T  Zn 


We  proceed  by  computing  I{xv\xv\{hu,v})  and  I(xw;  yw\{hu,v})  as  required  in 
Lemma  3.8  (the  conditioning  on  { hy^, }  is  again  due  to  availability  of  full  CSI).  Note 
first  that  by  construction  of  ^’^(n^+i),  we  have  for  any  w  G  W{n^+ 1) 


2  min  rVjW  >  max  rv>w, 

v£V(ne+1)  ’  veV(ne+1) 


and  therefore 

\K,W\2  (min  v&v(ne+1)rv,wya  2a 

\\hw\\2  ni+i  ( maxflei/(n<+1)  r V:W)  °  ne+i 

We  start  by  computing  I(xv;  xv\{huj}).  xv  has  mean  zero  and  variance 

E(w2)=  E  M^W/2+a2 

wGW(ne+ 1)  w 

<  ni+ 1 - Knjha^2  +  A2 

ne+i 

^  -1  a/2 

<ne+1a /  , 


(3.21) 


(3.22) 


for 

K  A  2~“(1  -  A2), 
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which  is  positive  for  A2  <  1,  and  where  we  have  used  (3.21)  and  that 

n7+\a“/2  >  2m7 (n)  >  1 

by  (3.12).  Equation  (3.22)  shows  that  xv  satisfies  the  power  constraint  of  node  v  in 
the  relay  subsquare  V(ne+i).  Moreover,  we  obtain 


I (,3'v >  | { ^u,v } )  l~l[xv  |  h  (xv\ Xv,\hu,v\^) 

<  log  ^27reE(|a:^|2)  j  —  log(27reA2) 

<  log  ("A?  )■  (3.23) 

It  remains  to  compute  I(xw',yw\{hutv})-  Note  that  the  encoding  procedure  guar¬ 
antees  that 

cos(7r/4)2||h,w||4  <  \hwhl,\2  <  ||hw||4. 

Moreover,  for  w  ^  w, 

E(\hwhl,\2)  =  E(hwhl,hyjhl) 

=  22  E{\Kw\2\hVw\2) 

veV(nt+ 1) 

=  22  e(im2im2) 

v&V(nt+ 1) 

=  E(\hwhi\2). 

From  this,  we  get  by  a  similar  argument  as  in  Lemma  3.9  that 

I(xw;yw\{hu,v})  >  Kq.  (3.24) 


Using  (3.23)  and  (3.24)  in  Lemma  3.8,  and  observing  that  we  only  communicate 
during  a  fraction 


Pe(n)ne+1a£ 


a/2 
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of  time,  yields  a  per  destination  node  rate  pf c  (n)  arbitrarily  close  to 


K6Pe(n)ni+ia£ 


■a/2 


bits  per  channel  use  and  a  quantizer  rate  arbitrarily  close  to 


log 


-1  a/2 

ne+iae 

A2 


bits  per  encoded  sample.  Since  by  (3.24)  mutual  information  I(xw;  yw\{hu^})  is  at 
least  K6  for  every  w  G  W{ng+ 1)  during  the  fraction  of  time  we  actually  communicate, 
this  implies  that  there  are  at  most  1/K&  encoded  message  samples  for  each  relay 
node  per  rq+ 1  total  message  bits  received  by  the  destination  nodes  W(ng+ 1).  Thus 
the  number  of  bits  required  at  each  relay  node  to  quantize  the  encoded  message 
samples  is  at  most 


<  K7{£  +  1)  log(n) 


bits  per  ng+ 1  total  message  bits  received  by  the  destination  nodes,  and  where  we  have 
used  7 (n)  <  n  by  (3.12).  □ 


3.5  Proof  of  Theorem  3.1 

The  proof  of  Theorem  3.1  is  split  into  two  parts.  In  Section  3.5.1  we  prove  the  theorem 
for  fast  fading,  and  in  Section  3.5.2  for  slow  fading. 

3.5.1  Fast  Fading 

In  this  section,  we  prove  Theorem  3.1  under  fast  fading,  i.e.,  {9UjV[t]}t  is  stationary 
and  ergoclic  in  t.  We  first  prove  that  the  assumptions  on  the  power  constraint  and 
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the  interference  made  in  Section  3.4  (see  Lemmas  3.9  and  3.10)  during  the  analysis  of 
one  level  of  the  hierarchical  relaying  scheme  are  valid.  We  then  use  the  results  proved 
there  to  analyze  the  behavior  of  the  entire  hierarchy,  yielding  a  lower  bound  on  the 
per-node  rate  achievable  with  hierarchical  relaying. 

We  first  argue  that  the  constraint  Pe(n)  <  nj^a^2  needed  in  Lemmas  3.9  and 
3.10  is  satisfied.  Consider  the  hierarchical  relaying  scheme  as  described  in  Section  3.2, 
and  fix  a  level  £,  0  <  £  <  L  =  L(n),  in  this  hierarchy.  At  level  £,  we  have  a  square 
of  area  ae  =  n/y’in)  with  a?  =  n/2ey(n)  source-destination  pairs.  Since  we  are  time 
sharing  between  K22~iry{n)  relay  subsquares  at  this  level,  we  have  an  average  power 
constraint  of 

Pe(n)  =  K^-^in) 

during  the  time  any  particular  relay  subsquare  is  active.  Since  a  >  2  and  since 
n/y~L(n\n)  — >  oo  as  n  — >  oo,  we  have,  for  n  large  enough  (independent  of  £),  that 


Pt(n)  =  Kjr^in) 

^2^(")(y<% 


n  \  a/2—1 

n)  {n) ) 
n  \«/2-i 


-1  a/2 

—  ne+1ae  . 


Therefore  the  power  constraints  in  Lemmas  3.9  and  3.10  are  satisfied. 

We  continue  by  analyzing  the  interference  caused  by  spatial  re-use.  Recall  that  the 
MAC  and  BC  phases  at  level  £  induce  permutation  traffic  within  the  dense  subsquares 
at  level  £  +  1.  The  permutation  traffic  within  those  dense  subsquares  at  level  £  +  1  is 
transmitted  in  parallel  with  spatial  re-use.  We  now  describe  in  detail  how  this  spatial 
re-use  is  performed.  Partition  the  subsquares  of  area  a^+i  (he.,  at  level  £  +  1)  into 
four  subsets  such  that  in  each  subset  all  subsquares  are  at  distance  at  least  y/ae+i 
from  each  other.  The  traffic  that  the  MAC  and  BC  phases  at  level  £  induce  in  each 
of  the  relay  subsquares  at  level  £  +  1  is  transmitted  simultaneously  within  all  relay 
subsquares  in  the  same  subset.  Consider  now  one  such  subset.  We  show  that  at  any 
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relay  subsquare  the  interference  from  other  relay  subsquares  in  the  same  subset  is 
stationary  and  ergodic  within  each  phase,  additive  (i.e.,  independent  of  the  signals 
and  channel  gains  in  this  relay  subsquare),  and  of  bounded  power  N0  —  1  independent 
of  n. 


We  first  argue  that  the  interference  is  stationary  and  ergodic  within  each  phase. 
Note  first  that  on  any  level  £  +  1  in  the  hierarchy,  all  relay  subsquares  are  either 
simultaneously  in  the  MAC  phase  or  simultaneously  in  the  BC  phase.  Furthermore, 
all  relay  subsquares  are  also  synchronized  for  transmissions  within  each  of  these  phases 
(recall  that  the  induced  traffic  in  level  l  +  1  is  uniform  and  is  sent  sequentially  as 
permutation  traffic).  Hence  it  suffices  to  show  that  the  interference  generated  by 
either  the  MAC  or  BC  induced  by  some  permutation  traffic  matrix  is  stationary  and 
ergodic.  Since  all  codebooks  for  either  of  these  cases  are  generated  as  i.i.d.  Gaussian 
multiplied  by  a  Bernoulli  process,  and  in  the  BC  phase  beamformed  for  stationary 
and  ergodic  fading,  this  is  indeed  the  case. 


The  additivity  of  the  interference  follows  easily  for  the  MAC  phase  since  code¬ 
books  are  generated  independently  of  the  channel  realization  in  this  case.  Moreover, 
since  the  channel  gains  are  independent  from  each  other  and  all  codebooks  are  gener¬ 
ated  as  independent  zero  mean  processes,  the  interference  in  the  MAC  phase  is  also 
uncorrelated  (over  space)  within  each  relay  subsquare.  For  the  BC  phase,  the  code¬ 
book  depends  only  on  the  channel  gains  within  each  relay  subsquare  at  level  £  +  1. 
Since  the  channel  gains  within  relay  subsquares  are  independent  of  the  channel  gains 
between  relay  subsquares,  this  interference  is  additive  as  well. 


We  now  bound  the  interference  power.  Note  that  by  the  randomized  time-sharing 
construction  within  the  MAC  and  BC  phases  (see  Lemmas  3.9  and  3.10),  in  each 
relay  subsquare  at  most  rie+i  nodes  transmit  at  an  average  power  of  1.  In  the  MAC 
phase,  all  nodes  use  independently  generated  codebooks  with  power  at  most  1,  and 
thus  the  received  interference  power  from  another  relay  subsquare  at  distance  i^/a^+ i 
is  at  most 


nl+li  aa 


-a/2 

m 


— a2-(*+ 1) 


1— a/2 


<  % 
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by  (3.12).  In  the  BC  phase,  the  nodes  in  each  active  relay  subsquare  use  beamforming 
to  transmit  to  nodes  within  their  own  subsquare.  Since  the  channel  gains  within  a 
relay  subsquare  are  independent  of  the  channel  gains  between  relay  subsquares,  the 
same  calculation  as  in  (3.19)  shows  that  we  can  upper  bound  the  received  interference 
power  from  another  relay  subsquare  at  distance  iy7z7+ 1  by 

ne+ii  ae+ {  <  i  , 


in  the  BC  phase  as  well. 


Now,  by  the  way  in  which  we  perform  spatial  re-use,  every  active  relay  subsquare 
has  at  most  Si  active  relay  subsquares  at  distance  at  least  i^/ae+i-  Hence  the  total 
interference  power  received  at  an  active  relay  subsquare  is  at  most 

OO 

8i2ai~a  4  iV0  -  1  <  cx) 

i= 1 

since  a  >  2.  With  this,  we  have  shown  that  the  interference  term  has  the  properties 
required  in  Lemmas  3.9  and  3.10. 


We  now  apply  those  two  lemmas  to  obtain  a  lower  bound  on  the  rate  achievable 
with  hierarchical  relaying.  Call  T({n)  the  number  of  channel  uses  to  transmit  one 
bit  from  each  of  ri£  source  nodes  to  the  corresponding  destination  nodes  at  level  t. 
Lemma  3.7  states  that  for  n  large  enough  (independent  of  £),  we  relay  over  each 
dense  subsquare  at  most  K32t:  times.  Combining  this  with  Lemma  3.9,  we  see  that 
to  transmit  one  bit  from  each  source  to  its  destination  at  this  level  we  need  at  most 


AK32eK92~ 


1  -1  a/2 

T r,  t  \ 

A4P£(n)  1+1 


K3  2n+3 
Kd 


n“/2-171^(1-Q/2)(n) 


channel  uses  for  the  MAC  phase.  Here,  the  factor  4  accounts  for  the  spatial  re-use, 
K32e  accounts  for  relaying  over  the  same  relay  subsquares  multiple  times,  K22~ir/{n) 
accounts  for  time  sharing  between  the  relay  subsquares,  and  the  last  term  accounts  for 
the  time  required  to  communicate  over  the  MAC.  Similarly,  combining  Lemmas  3.7 
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and  3.10,  we  need  at  most 


^ntt/2— y^1— al2\n) 

Kq 

channel  uses  for  the  BC  phase.  Moreover,  at  level  t  +  1  in  the  hierarchy  this  induces 
a  per-node  traffic  demand  of  at  most  K§  bits  from  the  MAC  phase,  and  at  most 
Kf(£  +  1)  log(n)  from  the  BC  phase.  Thus  we  obtain  the  following  recursion 

re(n)  <  +  -^-)n“/2_17(n)(471"Q/2(n))^  +  (AT5  +  K7(£  +  1)  log(n))rm(n) 

<  iCn“/2_17(n)4£  +  K{£  +  1)  log(n)re+i(n) 

<  iCna/2_1 7(n)4z'(n)  +  KL{n)  log(n)r£+i(n)  (3.25) 

for  positive  constants  K,  K  independent  of  n  and  t. 


At  scale  cil,  we  have  n 7  nodes  and  source-destination  pairs.  Time  sharing  between 
all  source-destination  pairs,  we  have  (during  the  time  we  communicate  for  each  node) 
an  average  power  constraint  of  ul-  Since  at  this  level  we  communicate  over  a  distance 
of  at  most  2 a1/2,  we  have 

TL(n)  <  nL  log"1  f  1  +  — ^Lq/2)  •  (3.26) 

V  2aN0CLaL/2J 

Since 

riLa^a^2  <  =  2”L^  — »  0 

as  n  — >  oo,  we  can  upper  bound  (3.26)  as 

rL(n)  <  K'aaL/2 

=  iC/na/27"L(n)Q/2(n) 

<  A"n"/27"L(n)(n)  (3.27) 


for  some  constant  AT 


84 


Now,  using  the  recursion  (3.25)  L(ri)  times,  and  combining  with  (3.27),  we  obtain 

r0 (n)  <  Kna/2~l 7(n)4L^  +  KL{n)  log(n)ri(n) 

<  . . . 

/  L(n)—1  . 

<  K nat2~lry{n)AL^  f  ^  {KL(n)  log(n))^  j  +  [KL{n)  \og(n))L^ rL(n) 

'  £=0  ' 

<  n"^2_1  ( KL(n )  log(n))i(,l)  4L^7(n)  +  K' n^~L^ {n)^j  .  (3.28) 

Using  the  definition  of  7(71)  and  L(n)  in  (3.11),  we  have  for  n  large  enough 

[KL{n)  log(n))L(n)  <  7l21°g -^-'(nlloglog^ 

4l^7 (n)  <  n21°g_1/2_,5U)+iogi_1/2H; 
n7“L(n)(n)  <  ni°g4_1/2W. 

Since  5  >  0,  the  n10^  1/2U)  term  dominates  in  (3.28),  and  we  obtain 

r0 (n)  <  b(n)na^2~l , 

where 

6(n)  <  n0(1°g'"1/2(n)), 

as  n  — >  00.  Therefore 

p*(n)  >  pHR(n)  =  l/r0(n)  >  &(n)n1_a/2, 

with 

6(n)  >  n-°(log5'1/2(n)), 

concluding  the  proof  for  the  fast  fading  case. 


85 


3.5.2  Slow  Fading 


In  this  section,  we  prove  Theorem  3.1  under  slow  fading,  i.e. ,  {0UjV[t]}t  is  constant  as 
a  function  of  t.  We  sketch  the  necessary  modifications  for  the  scheme  described  in 
Section  3.2  to  achieve  a  per-node  rate  of  at  least  &(n)n1_“/2  in  the  slow  fading  case. 

Consider  level  £,  0  <  £  <  L{n ),  in  the  hierarchy.  Instead  of  relaying  the  message 
of  a  source-destination  pair  over  one  relay  subsquare  as  in  the  scheme  described  in 
Section  3.2,  we  relay  the  message  over  many  dense  subsquares  that  are  at  least  at 
distance  from  both  the  source  and  the  destination  nodes.  We  time  share 

between  the  different  relays.  The  idea  here  is  that  the  wireless  channel  between 
any  node  and  its  relay  subsquare  might  be  in  a  bad  state  due  to  the  slow  fading, 
making  communication  over  this  relay  subsquare  impossible.  Averaged  over  many 
relay  subsquares,  however,  we  get  essentially  the  same  performance  as  in  the  fast 
fading  case. 

We  first  state  a  (somewhat  weaker)  version  of  Lemma  3.7,  appropriate  for  this 
setup.  Consider  again  the  collection  of  schedules  S{n g)  and  S(n g)  satisfying  the  con¬ 
ditions  that  no  relay  subsquare  is  selected  by  more  than  ng+\  source-destination  pairs 
and  that  all  sources  and  destinations  are  at  least  at  distance  ^/2a^+i  from  their  relay 
subsquare  (see  Section  3.4.1  for  the  formal  definition).  The  next  lemma  shows  that  for 
each  source-destination  pair,  we  can  find  K22~e'~1^{n)  distinct  relay  subsquares  sat¬ 
isfying  the  above  conditions  (the  requirement  that  these  relay  subsquares  are  distinct 
is  expressed  by  the  orthogonality  condition  of  the  schedules  in  Lemma  3.11  below). 

Lemma  3.11.  For  every  n  large  enough  (independent  of  £)  and  every  permutation 
traffic  matrix  Auc(rq)  e  {0,l}neXne  there  are  schedules  {S^\ng)}^f\  7  ^  c  S(ne)> 
{S^(ng)}^  7  (n)  C  S(ng)  satisfying 

A'22-V(A 

AUCM  =  — E  S®(ne)S®(ne), 

1  '  '  i=  1 

where  {S^\ng)}i,  {S^\ng)}i  are  collections  of  orthogonal  matrices  in  the  sense  that 


for  i  7^  i' 


u,k^u,k 


o, 


u,k 


o. 


k,u 


(3.29) 


Proof.  The  proof  is  similar  to  that  of  Lemma  3.7.  In  order  to  construct  {S'W  (n^)}  and 
consider  the  sequential  pass  over  all  n  source-destination  pairs  (assume  n 
is  large  enough  for  Lemma  3.7  to  hold).  As  before,  for  each  source-destination  pair, 
there  are  K22~e~lry(n)  dense  relay  subsquares  that  are  at  distance  at  least  y/2ag+i. 
Each  pair  chooses  all  of  these  K22~e~1'y(n)  subsquares  instead  of  just  one  as  before. 
Stop  one  round  of  this  procedure  as  soon  as  any  of  the  relay  subsquares  is  chosen  by 
ri£+ 1  pairs.  Since  by  the  end  of  one  round  at  least  one  relay  subsquare  is  matched  by 
ri£+ 1  source-destination  pairs,  there  are  at  most  n^/n^+i  =  2y (n)  such  rounds. 

Consider  now  the  result  of  one  such  round.  We  construct  K22~l~1^{n)  matrices 
S^\nf)  and  with  the  i-th  pair  of  matrices  describing  communication  over  the 

i-th  relay  subsquares  chosen  by  source-destination  pairs  matched  in  this  round.  Thus, 
this  process  produces  a  total  of  2'y(n)K22~e~1'y(n)  =  K22~l^2fn)  such  matrices.  The 
orthogonality  property  follows  since  each  source-destination  pair  relays  over  the  same 
relay  subsquare  at  most  once.  □ 


Having  decomposed  the  scaled  traffic  matrix  K22~i~1'y(n)Xuc(n)  into  K22~irf1(n) 
matrices,  each  source-destination  pair  tries  to  relay  over  K22~e~1'y(n)  dense  sub¬ 
squares.  We  time  share  between  these  relay  subsquares.  Since  each  source-destination 
pair  relays  only  a  (K22~e~lry(n))~1  fraction  of  traffic  over  any  of  its  relay  subsquares, 
the  loss  due  to  this  time  sharing  is  now 


K22-l^[n) 

Jl22-^-17(n) 


27  (n) 


as  opposed  to  A'32£  in  Lemma  3.7.  In  other  words,  the  loss  is  at  most  a  factor  27 (n) 
more  than  in  Lemma  3.7.  Using  the  definition  of  7 (n)  in  (3.11),  we  have 


7(71)  <  n  105,5  1/2 ^  <  b  1(n). 


87 


In  other  words,  this  additional  loss  is  small. 


Consider  now  a  specific  relay  subsquare.  If  a  source-destination  pair  can  commu¬ 
nicate  over  this  relay  subsquare  at  a  rate  at  least  1/64-th  of  the  rate  achievable  in  the 
fast  fading  case  (given  by  Lemmas  3.9  and  3.10),  it  sends  information  over  this  relay. 
Otherwise  it  stays  silent  during  the  period  of  time  it  is  assigned  this  relay.  We  now 
show  that,  with  probability  1  —  o(l)  as  n  — >  oo,  for  every  source-destination  pair  on 
every  level  of  the  hierarchy  at  least  one  quarter  of  its  relay  subsquares  can  support 
this  rate.  As  we  only  communicate  over  a  quarter  of  the  relay  subsquares,  this  implies 
that  we  can  achieve  at  least  1/256-th  of  the  per- node  rate  for  the  fast  fading  case  (see 
Section  3.5.1),  i.e.,  that  b{n)nl~a /2  is  achievable  with  probability  1  —  o(l)  as  n  — >  oo. 


Assume  we  have  for  each  source-destination  pair  (u,  w)  picked  K22~£~lry(n)  dense 
subsquares  over  which  it  can  relay;  call  those  relay  subsquares  {AU:Wtk}k^l  Y<n^. 
Consider  the  event  Bu^wk  that  source  node  u  can  communicate  at  the  desired  rate 
to  destination  node  w  over  relay  subsquares  Auwj.  (assuming,  as  before,  that  we  can 
solve  the  communication  problem  within  this  subsquare). 


Let  { B^w  k}f=1  be  the  events  that  the  interference  clue  to  matched  filtering  in  the 
MAC  phase,  the  interference  from  spatial  re-use  in  the  MAC  phase,  the  interference 


due  to  beamforming  in  the  BC  phase,  and  the  interference  from  spatial  re-use  in  the 
BC  phase,  are  less  than  8  times  the  one  for  fast  fading,  respectively.  From  the  proof 
of  Lemmas  3.9,  3.10,  and  of  Theorem  3.1  for  the  fast  fading  case  in  Section3.5.1,  we 


see  that 


n 


A®)  T3 

Du,w,k  Du,w,k- 


/  1 

Due  to  spatial  re-use,  multiple  relay  subsquares  will  be  active  in  parallel.  Let  H 
denote  the  set  of  channel  gains  between  active  relay  subsquares.  Using  essentially  the 
same  arguments  as  for  the  fast  fading  case  (see  Lemmas  3.9,  3.10,  and  Section  3.5.1) 
and  from  Markov’s  inequality,  we  have  ¥(B^W  k\H)  >7/8  for  all  i  e  {1, ...  ,4}  and 
hence  ¥(BU:Wjk\H)  >  1/2. 


We  now  argue  that  the  events 


nLi  B 


i(i)  K22-t-iy(n) 


(3.30) 


are  independent  conditioned  on  H,  by  showing  that  these  events  depend  on  disjoint 
sets  of  channel  gains  and  codebooks.  Assuming  the  codebooks  are  generated  new 
for  each  communication  round,  then  they  are  all  independent.  Thus  we  only  have 
to  consider  the  dependence  on  the  channel  gains.  Let  Uk  and  144  be  the  source  and 
destination  nodes  communicating  over  relay  subsquare  AUtW^  in  round  k,  and  let  14 
be  the  nodes  in  AUjWtk.  Let  £4,  144  be  the  source  and  destination  nodes  that  are 
communicating  at  the  same  time  as  (u,  w )  due  to  spatial  re-use.  Let  14  be  the  relay 
nodes  of  Uk  and  444-  Now,  B^wk  and  B^wk  depend  (for  fixed  H)  on  the  channel 
gains  between  Uk  and  14-  Bu  wk  depends  on  the  channel  gains  between  14  and  144- 
Buw  k  depends  (again  for  fixed  H)  on  the  channel  gains  between  14  and  144-  Since 
these  sets  are  disjoint  for  different  k  by  the  orthogonality  of  the  schedules  (see  (3.29)), 
conditional  independence  of  the  events  in  (3.30)  follows. 


To  summarize,  conditioned  on  the  channel  gains  H  between  active  relay  sub¬ 
squares,  the  random  variables  {1 Buwk}k  are  independent  and  have  expected  value 
E(!iwl*0  -  V2-  The  sum 

A'22-«-17(n) 

V  1B  k 

/  j  J-'lL,W,K 

k=  1 

is  the  number  of  relay  subsquares  over  which  the  source-destination  pair  (u,  w )  suc¬ 
cessfully  relays  traffic.  We  now  show  that  with  high  probability  at  least  one  quarter  of 
these  relay  subsquares  allow  successful  transmission.  Indeed,  by  the  Chernoff  bound, 


PlEtls.,..*  <  K22-‘-3-y(n) 


H 


—  E {Y2k^-Bu,w,k  <  -^22  7(n)P(SU)U))fe 

<  exp  (  -  2K2~i'y(n)F(BUtW,k\H)) 

<  exp  (  -  K2~t'y(n)) 
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for  some  constant  K  >  0.  Since  the  right-hand  side  is  the  same  for  all  H ,  this  implies 


<  K,2-‘S(n))  <  exp  (  -  I( Tci(n)). 


In  each  of  the  L(ri)  levels  of  the  hierarchy  there  are  at  most  n 2  source-destination 
pairs,  and  hence  by  the  union  bound  with  probability  at  least 

1  —  L[n)n 2  exp  (  —  K2~L^'y(n))r 

for  every  source-destination  pair  on  every  level  of  the  hierarchy  at  least  one  quarter 
of  its  relay  subsquares  can  support  the  desired  rate.  By  the  choices  of  7 (n)  and  L(n) 
in  (3.11),  this  probability  is  at  least 

1  -  L(n)n2  exp  (  -  K2~L^1{n))  >  1  -  n3exp  (  -  A^-L(n)2log(n)/2L(n)) 

>  1  —  exp  ^A'2loglog(n)  —  A'2^logl/2+'5('n^_Iogl/2_'5^n-)  j 

>  1  -  exp  (  -  2nl,0«‘,2+<l”») 

>l-o(l) 

as  n  — >  00,  and  for  some  constant  K.  This  proves  that  the  same  order  rate  as  in  the 
fast  fading  case  can  be  achieved  with  high  probability  for  all  levels  0  <  £  <  L{n). 

It  remains  to  argue  that  the  same  holds  for  level  t  =  L(n).  Note  that  since  we 
assume  phase  fading  only,  the  received  signal  power  is  only  a  function  of  distance  and 
not  of  the  fading  realization.  Since  at  level  L(n)  we  use  simple  time  sharing,  this 
implies  that  we  can  always  achieve  the  same  rate  at  level  L(n)  as  in  the  fast  fading 
case. 

Hence  with  probability  1  —  o(l)  as  n  — >  00,  we  achieve  the  same  order  rate  at  each 
level  0  <  t  <  L(n)  as  for  fast  fading,  proving  Theorem  3.1  for  the  slow  fading  case. 
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3.6  Proof  of  Theorem  3.2 


Here,  we  provide  a  generalization  and  sharpening  of  the  converse  in  [38].  Most  of  the 
arguments  follow  [38,  Theorem  5.2],  We  start  by  proving  a  lemma  upper  bounding 
the  MIMO  capacity. 

Consider  two  subsets  Si,  62  C  V (n)  such  that  Si  fl  S2  =  0.  Assume  we  allow 
the  nodes  within  Si  and  S2  to  cooperate  without  any  restriction.  The  maximum 
achievable  sum  rate  between  the  nodes  in  Si  and  S2  is  given  by  the  MIMO  capacity 
C(Si,S2)  between  them.  The  next  lemma  upper  bounds  C(Si,S2)  in  terms  of  the 
node  distances  between  the  two  sets  and  the  normalized  channel  gains 


h  — 

IV1I.V  - 


1 1  I 


E 


r~a 
v€S2  u,v 


(3.31) 


Lemma  3.12.  Under  either  fast  or  slow  fading,  for  every  a  >  2,  for  any  node 
placement  V{n),  and  any  Si,S2  C  V{n)  with  Si  0  S2  =  0,  we  have 

C (Si,  S2)  <  4  T  max  1 1,  max  ew})eec 

'  ueSi  '  ueSi  veS2 


Proof.  Let 


H  =  {hu>v} 


u£Sl,VGS2  1 


H  =  {hu,v} 


u£Sl,VGS2  5 


be  the  matrix  of  (normalized)  channel  gains  between  the  nodes  in  Si  and  S2.  Consider 
first  fast  fading.  Under  this  assumption,  we  have 


C(S1,S2) 


max  E  log  det  (I  +  H]Q(H)H ) 
Q(H)>  0:  V  V  7 

E(gu,u)<i  V«eS  1 


Dehne 

PSi,S2  =  rn' 

uGSi  vGS2 
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as  the  total  received  power  in  S2  from  Si,  and  set 


Pu,S2  =  P{u},S2 


with  slight  abuse  of  notation.  Then 


C{SU  S2)  =  QmaXQ  E  ^  logdet  (J  +  H^Q(H)H) 

E(<?u,u)<Pu,s2Vue'S'i 

<  max  Ej  logdet  (I  +  H]Q(H)H ) 

Q{H)>  0:  V  V 

E(trQ(i/))<PSl,S2 


(3.32) 


Define  the  event 

B  =  {\\H\\2  >  b } 

for  some  b  and  where  ||JT||  denotes  the  largest  singular  value  of  H.  In  words,  B  is 
the  event  that  the  channel  gains  between  S\  and  S2  are  “good”.  We  argue  that,  for 
appropriately  chosen  b ,  the  event  B  has  probability  zero  (i.e.,  the  channel  cannot  be 
too  “good”).  By  Markov’s  inequality 

P (B)  <  b~mE(\\H\\2m),  (3.33) 

for  any  m.  We  continue  by  upper  bounding  E(||iT||2m).  We  have 

||ff||2fc  <  tr 


for  any  k,  and  hence 


E(||iT||2m)  <  E^(tr ({HH')k))m/ky 


(3.34) 


Now,  for  any  k  >  m ,  we  have  by  Jensen’s  inequality 


E^(tr((iTiTt)fc))m/fc^  <  ^Etr((iTiTt)fc)^m/fc.  (3.35) 
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Combining  (3.33),  (3.34),  and  (3.35)  yields 


P (B)  <  b~m  ( Etr((i?i/t)fc) 


m/k 


(3.36) 


for  any  k  >  m. 

Now,  the  arguments  in  [38,  Lemma  5.3]  show  that 


E(tr  ((ifift)*))  <  tk  n  (  max  1 1,  max  Ei'1 


u£Si 


where  tk  is  the  k-th  Catalan  number.  Combining  with  (3.36),  this  yields 


P (B)  <  ^6  1t]jkn1^k  f  max  |l,  m^x  ^  |h.U;tl|2  j) 


uES  i 


1/k 

Taking  the  limit  as  k  — >  oo  and  using  that  — »■  4  yields 


P (B)  <  (  b  x4(  max  <  1,  max  \  h 
V  l  v£S2  '  J 

u£Si 


Assume 


b  >  4  j  max  1 ,  max  EI'V»I  i  .. 

uGSi 


then  taking  the  limit  as  m  — >  oo  shows  that 


P  (B)  =  0. 


Using  this,  we  can  upper  bound  (3.32)  as 


(3.37) 


C(S1,S2)<  max  E^tr  \H^Q(H)Hj 

E(tr  Q(H))<Ps1,s2 


=  max  E(  l^ctrl  H^Q(H)H 

Q(H)>  0:  V  V 

E(tr  Q(H))<PSl,s2 

<  max  E^l^c  ||iT||2tr(5(iT) 

Q(H)>  0:  V 

E(trQ(H))<PSljS2 
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<  bPSl,s2- 

Since  this  is  true  for  all  b  satisfying  (3.37),  we  obtain  the  lemma  for  the  fast  fading 
case. 

Under  slow  fading 


C(Si,  S2)  =  max  log  det  (I  +  H^QH) , 
qu,u<P  VueSi 

and  the  lemma  can  be  obtained  by  the  same  steps.  □ 

We  now  proceed  to  the  proof  of  Theorem  3.2.  Consider  a  vertical  cut  dividing  the 
network  into  two  parts.  By  the  minimum-separation  requirement,  an  area  of  size  o{n ) 
can  contain  at  most  o{n )  nodes,  and  hence  we  can  find  a  cut  such  that  each  part  is 
of  size  0(n)  and  contains  O(n)  nodes.  Call  the  left  part  of  the  cut  S.  Since  there  are 
0(n)  nodes  in  S  and  in  Sc,  there  are  O(n)  sources  in  S  with  their  destination  in  Sc 
with  probability  1  —  o(l).  For  technical  reasons  we  add  a  node  inside  each  square  in 
V{n)  of  the  form  [id,  (i  +  1  )d]  x  [jd,  (j  +  l)d\  for  some  i,j  e  N,  where  d  =  a/2  log(n). 
These  additional  nodes  have  no  traffic  demands  on  their  own,  and  simply  help  with 
the  transmission.  This  can  clearly  only  increase  achievable  rates.  Moreover,  this 
increases  the  number  of  nodes  in  V  by  less  than  a  factor  2.  We  now  show  that 

C(S ,  Sc)  =  O  ( log6(n)n2-Q/2) ,  (3.38) 

and  hence  by  the  cut-set  bound  [9,  Theorem  14.10.1],  and  since  there  are  O(n)  sources 
in  S  with  their  destination  in  Sc,  we  have 

p*(n)  =  0(log6(n)n1_a/2). 

We  prove  (3.38)  using  Lemma  3.12.  To  this  end,  we  need  to  upper  bound 


94 


The  proof  of  [38,  Lemma  5.3]  shows  that  if 

1.  there  are  less  than  log(n)  nodes  inside  [i,i  +  1]  x  \j,j  +  1]  for  any  i,j  e 
{0,  ...,y/n-  1}, 

2.  there  is  at  least  one  node  inside  [id,  (■ i  +  l)rf]  x  [jd,  ( j  +  l)d]  for  any  i,j,  where 
d  =  y/2  log  n, 

then 

max  'y^j\hUyV\2  <  K  log3 (n) ,  (3.39) 

u^S 

and  for  aG  (2,3] 

^  r"“  <  K  log 3(n)n2-“/2,  (3.40) 

for  constants  K,  K.  For  arbitrary  node  placement  with  minimum  separation,  the  first 
requirement  is  satisfied  for  n  large  enough,  since  only  a  constant  number  of  nodes 
can  be  contained  in  each  area  of  constant  size.  By  our  addition  of  nodes  into  V (■ n ) 
described  above,  the  second  condition  is  also  satisfied.  Using  Lemma  3.12  with  (3.39) 
and  (3.40)  yields  (3.38),  concluding  the  proof  of  Theorem  3.2. 

3.7  Proof  of  Theorem  3.3 

Consider  a  node  placement  with  n/2  nodes  located  uniformly  on  [0,  y/n/ 4]  x  [0,  y/ri] 
and  n/2  nodes  located  on  [y/n/ 2,  y/n\  x  [0,  y/n]  with  minimum  separation  rmin  =  1/2. 
A  random  traffic  matrix  Auc(n)  is  such  that  at  least  n/4  communication  pairs  have 
their  sources  in  the  left  cluster  and  destinations  in  the  right  cluster  with  probability 
1  —  o(l).  Assume  we  are  dealing  with  such  a  Auc(n)  in  the  following. 

In  this  setup,  with  multi-hop  at  least  one  hop  has  to  cross  the  gap  between  the 
left  and  the  right  cluster.  Thus,  even  without  any  interference  from  other  nodes,  we 
can  obtain  at  most 

pMH(n)  <  4 an~a/2. 

Moreover,  considering  a  cut  between  the  two  clusters  (say,  S  and  Sc ),  and  applying 
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Lemma  3.12  yields  that 


p*(n)  <  16 n  *(  max  1 1 ,  max  |  hU)V  j 

VG  u£S  /  u£S  v£Sc 


EE 


(3.41) 


Now  note  that  for  any  u  G  S',  v  £  Sc,  we  have 


<  rUjV  <  2  y/n. 


Hence 


El '*«.«! 2  =  E 

u£S  u£S  ^6 Sc  u,v 


and 

E  E  ^ 

ueS  veSc 

Combining  this  with  (3.41)  yields 


p*(n )  <  22+5an1~a/2 


for  all  a  >  2. 


3.8  Proof  of  Theorem  3.4 

We  construct  a  cooperative  multi-hop  communication  scheme  and  lower  bound  the 
per- node  rate  pCMH(n)  it  achieves.  We  use  the  hierarchical  relaying  scheme  as  build¬ 
ing  block.  Assume  the  node  placement  V(n)  is  //-regular  at  resolution  d(n)  for  all 
n  >  1.  We  show  that  this  implies  that  we  can  achieve  a  per-node  rate  of  at  least 
d3~a (n)?/-1/2-^71)  as  n  — >  oo.  Taking  the  smallest  such  d(n)  then  yields  the  result. 

We  consider  three  cases  for  the  value  of  d(n)  (namely,  d(n)  =  Q(y/n),  d(n)  >  n°^\ 
and  d(n)  <  n°^).  First,  if  d(n)  =  Q(y/n)  as  n  — >  oo  then  the  result  follows  directly 
from  Theorem  3.1.  Considering  a  subsequence  if  necessary,  we  can  therefore  assume 
without  loss  of  generality  that  d(n)  =  o(y/n)  in  the  following. 
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Second,  consider  d(n)  satisfying 


d(n)  >  n^log5"1/2(n).  (3.42) 

Divide  A(n)  into  squares  of  sidelength  d(n).  Since  d(n)  =  o(y/n),  the  number  of 
such  squares  grows  unbounded  as  n  ->  oo.  We  now  show  that  we  can  use  multi¬ 
hop  communication  with  a  hop  length  of  d(n)  where  each  hops  is  implemented  by 
squares  cooperatively  sending  information  to  a  neighboring  square.  In  other  words,  we 
perform  cooperative  communication  at  local  scale  d(n)  and  multi-hop  communication 
at  global  scale  \Jn. 

Since  V(n)  is  /^-regular  at  resolution  d(n),  each  such  square  contains  at  least 
fid2(n)  nodes.  Pick  the  top  left  most  square  and  construct  the  square  of  sidelength 
2 d(n)  consisting  of  it  together  with  its  3  neighbors.  Continue  in  the  same  fashion, 
partitioning  all  of  A(n)  into  squares  of  sidelength  2 d(n).  Note  that  each  such  bigger 
square  contains  at  least  4 /j,d2(n)  nodes,  and  we  assume  this  worst  case  in  the  following. 
Partition  A{n)  into  4  subsets  of  those  bigger  squares  such  that  within  each  such  subset 
each  square  is  at  distance  at  least  2 d(n)  from  any  other  square  (see  Figure  3-6).  We 
time  share  between  those  4  subsets.  Consider  in  the  following  one  such  subset.  For 
every  bigger  square,  we  construct  two  permutation  traffic  matrices  AiC(4yud2(n))  and 
\^c{A/id2(n)).  In  the  nodes  in  the  top  two  squares  have  as  destinations  the 
nodes  in  the  bottom  two  squares  and  the  nodes  in  the  bottom  two  squares  have  as 
destinations  the  nodes  in  the  top  two  squares  (see  Figure  3-6).  Similarly,  A^0  contains 
communication  pairs  between  left  and  right  squares.  We  time  share  between  A^c  and 

a2uc- 

Communication  according  to  Atuc  within  bigger  squares  in  the  same  subset  occurs 
simultaneously.  We  are  going  to  use  hierarchical  relaying  within  each  bigger  square. 
This  is  possible  since  each  such  square  contains  at  least  4/id2(n)  nodes.  We  have  to 
show  that  the  additional  interference  from  bigger  squares  in  the  same  subset  is  such 
that  Theorem  3.1  still  applies.  In  particular,  we  need  to  show  that  the  interference 
has  bounded  power,  say  K .  Using  the  same  arguments  as  in  the  proof  of  Theorem  3.1 
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Figure  3-6:  Sketch  of  the  construction  of  the  cooperative  multi-hop  scheme  in  the 
proof  of  Theorem  3.4.  The  dashed  squares  have  sidelength  d(n).  The  gray  area  is 
one  of  the  4  subsets  of  bigger  squares  that  communicate  simultaneously.  The  arrows 
indicate  the  traffic  matrix  A^c. 

in  Section  3.5  yields  that  this  is  indeed  the  case  (the  interference  from  other  bigger 
squares  here  behaves  the  same  way  as  the  interference  due  to  spatial  re-use  from  other 
active  relay  subsquares  there).  With  this,  we  are  now  dealing  with  a  hierarchical 
relaying  scheme  with  area  4 d2(n),  4 jud2(n)  nodes,  and  additive  noise  with  power 
1  +  K .  Both  the  lower  number  of  nodes  and  the  higher  noise  power  will  decrease  the 
achievable  per- node  rate  by  only  some  constant  factor,  and  hence  Theorem  3.1  shows 
that  under  fast  fading  we  can  achieve  a  per-node  rate  of  at  least 

61(d2(n))(d2(n))1-“/2  >  bi{n)d2~a{n), 

as  n  — >  oo,  where 

6i(n)  >n-°(logi_1/2(n)). 

Moreover,  the  same  rate  is  achievable  under  slow  fading  with  probability  1  —  &2(d2(n)), 
where 

b2(n)  <exp(^-2n(logl/2+5(n))). 

The  setup  is  the  same  for  all  bigger  squares  within  each  of  the  4  subsets. 
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We  now  “shift”  the  way  we  defined  the  bigger  squares  by  d(n)  to  the  right  and 
to  the  bottom.  With  this,  each  new  bigger  square  intersects  with  4  bigger  squares 
as  defined  before.  We  use  the  same  communication  scheme  within  these  new  bigger 
squares  and  time  share  between  the  two  ways  of  defining  bigger  squares. 

Construct  now  a  graph  where  each  vertex  corresponds  to  a  square  of  sidelength 
d(n)  and  where  two  vertices  are  connected  by  an  edge  if  they  are  adjacent  in  either 
the  same  old  or  new  bigger  square.  This  graph  is  depicted  in  Figure  3-4  in  Section 
3.3. 

With  the  above  construction,  we  can  communicate  along  each  edge  of  this  graph 
simultaneously  at  a  per-node  rate  of 


bi(n) 

16 


d 


2— a 


(n) 


in  the  fast  fading  case.  In  the  slow  fading  case,  this  statement  holds  with  probability 
at  least 


1  -  4mb2(An))  ^~4mexp(-  2n(logW<J>l,)) 

>  1  —  exp  ^/C2logloghd  _  2^los1/2+6(d(n))^ 
for  constants  K',K.  By  assumption  (3.42), 

log 1/2+5  (d(n))  >  (^-^log1/2+<5(n))  '  > 

and  hence 

as  n  — »  oo,  showing  that  with  high  probability  we  achieve  the  same  order  rate  under 
slow  fading  as  under  fast  fading. 

The  graph  constructed  forms  a  grid  with  n/d2(n)  nodes.  Using  that  each  bigger 
square  can  contain  at  most  Kid2(n)  nodes  by  the  minimum-separation  requirement, 
standard  arguments  for  routing  over  grid  graphs  (see  [30])  show  that  in  the  fast  fading 
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case  we  can  achieve  a  per-node  rate  of 


pCMH(n)  >  b(n)d2  >  b(n)d3  a(n)n  1^2, 

y/n 

where 

b(n)  =n-°(log''V2(n)). 

Moreover,  the  same  statement  holds  in  the  slow  fading  case  with  probability  1  —  o(l). 
Finally,  consider  d(n)  such  that 


d(n)  <  n2+ 


^log5-1/2H 


(3.43) 


Construct  the  same  graph  as  before,  but  this  time  we  use  simple  multi-hop  commu¬ 
nication  between  adjacent  squares  of  sidelcngth  d(n).  By  time  sharing  between  the 
at  most  Kid2(n )  nodes  in  each  square,  and  since  we  communicate  over  a  distance  of 
at  most  3 d(n),  we  achieve  under  either  fast  of  slow  fading  a  per-node  rate  between 
the  squares  of  at  least 

K"  d~2~a(n)  >  A'"n-log5'1/2(n) 

for  some  constant  K",  and  where  we  have  used  (3.43).  Using  the  analysis  of  grid 
graphs  as  before,  we  can  achieve  a  per-node  rate  of  at  least 

pCMH(n)  >  K"n-logS-1/2(n)^=L  >  b(n)d3~a(n)n-1/2, 

Vn 

for  either  the  fast  or  slow  fading  case. 


3.9  Proof  of  Theorem  3.5 

Consider  V (n)  with  nj 2  nodes  located  uniformly  on  [0,  (y/n  —  d*(n))/2]  x  [0,  y/n\  and 
n/2  nodes  located  uniformly  on  [yfn/ 2,  y/n\  x  [0,  y/n\  such  that  rmin  =  1/2.  This  node 
placement  is  1/2-regular  at  resolution  d*(n).  A  random  traffic  matrix  A uc(n)  is  such 
that  Q(n)  communication  pairs  have  their  sources  in  the  left  cluster  and  destinations 
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in  the  right  cluster  with  probability  1  —  o(l).  Assume  we  are  dealing  with  such  a 
Auc(n)  in  the  following. 

Considering  a  cut  between  the  two  clusters  and  applying  Lemma  3.12  (slightly 
adapting  the  arguments  in  Section  3.6),  yields  that 

p*(n)  =  0(  log6(n)d*3-a(n)n-1/2) 


for  a  >  3. 


3.10  Discussion 

We  briefly  discuss  several  aspects  of  the  proposed  hierarchical  relaying  scheme.  Sec¬ 
tion  3.10.1  comments  on  the  full  CS1  assumption  and  Section  3.10.2  on  the  use  of 
bursty  communication.  Sections  3.10.3  and  3.10.4  outline  how  the  results  obtained 
here  can  be  extended  to  the  case  of  dense  networks  and  networks  without  minimum 
separation  between  nodes.  Section  3.10.5  compares  our  hierarchical  relaying  scheme 
to  the  hierarchical  cooperation  scheme  presented  in  [38].  Section  3.10.6  discusses 
design  guidelines. 


3.10.1  Full  CSI  Assumption 

Throughout  our  analysis,  we  have  made  a  full  CSI  assumption.  In  other  words,  we 
assumed  that  the  phase  shifts  [0lhV[t] }uv  are  available  at  time  t  at  all  nodes  in  the 
network.  As  was  pointed  out  in  Section  2.2,  a  2-bit  quantization  of  the  channel  state 
{&u,v[t]}u,v  available  at  all  nodes  at  time  t  is  sufficient  to  obtain  the  same  scaling 
behavior.  This  follows  by  an  argument  similar  to  the  one  used  in  the  analysis  of  the 
BC  phase  in  Section  3.4.3,  where  it  is  shown  that  beamforming  using  a  quantized 
channel  state  results  only  in  a  constant  factor  rate  loss. 
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3.10.2  Burstiness  of  Hierarchical  Relaying  Scheme 

The  hierarchical  relaying  scheme  presented  here  is  bursty  in  the  sense  that  nodes 
communicate  at  high  power  during  a  small  fraction  of  time.  This  leads  to  high  peak- 
to-average  power  ratio,  which  is  undesirable  in  practice.  We  chose  burstiness  in  the 
time  domain  to  simplify  the  exposition.  The  same  bursty  behavior  could  be  achieved 
in  a  more  practical  manner  by  using  CDMA  with  several  orthogonal  signatures  or  by 
using  OFDM  with  many  sub-carriers.  Each  approach  leads  to  many  parallel  channels 
out  of  which  only  few  are  used  with  higher  power.  This  avoids  the  issue  of  burstiness 
in  the  time  domain. 

3.10.3  Dense  Networks 

Throughout  this  chapter,  we  have  only  considered  extended  networks,  i.e,  n  nodes 
placed  on  a  square  region  of  area  n  with  a  minimum  separation  of  ru/0  >  rmin.  The 
results  can,  however,  be  recast  for  dense  networks,  where  n  nodes  are  arbitrarily 
placed  on  a  square  region  of  unit  area  with  a  minimum  separation  of  ru,v  >  rmin/ y/n. 
It  suffices  to  notice  that  by  rescaling  power  by  a  factor  n~a /2  a  dense  network  can 
essentially  be  transformed  into  an  extended  network  with  path-loss  exponent  a  (see 
also  [38]).  Hence  the  same  result  for  dense  networks  can  be  obtained  from  the  result 
for  extended  networks  by  considering  the  limit  a  — >  2.  Applying  this  to  Theorem  3.1, 
yields  a  linear  per-node  rate  scaling  of  the  hierarchical  relaying  scheme. 

3.10.4  Minimum-Separation  Requirement 

The  minimum-separation  requirement  rmin  G  (0, 1)  on  the  node  placement  is  sufficient 
but  not  necessary  for  Theorem  3.1  to  hold.  A  weaker  sufficient  condition  is  that  a 
constant  fraction  of  subsquares  are  dense,  as  shown  in  Lemma  3.6  to  be  a  consequence 
of  the  minimum-separation  requirement.  It  is  straightforward  to  show  that  this  weaker 
condition  is  satisfied  with  high  probability  for  nodes  placed  uniformly  at  random  on 
[0,  y/n\2-  In  fact,  it  can  be  shown  that  for  a  random  node  placement  all  subsquares  at 
all  levels  0  <  t  <  L{n )  are  dense  with  high  probability.  This  yields  a  different  proof 
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of  Theorem  5.1  in  [38]. 


3.10.5  Comparison  with  Prior  Work 

Both,  the  hierarchical  relaying  scheme  presented  here  and  the  hierarchical  scheme 
presented  in  [38],  share  that  they  use  virtual  multiple-antenna  communication  and 
a  hierarchical  architecture  to  achieve  essentially  global  cooperation  in  the  network. 
The  schemes  differ,  however,  in  several  key  aspects,  which  we  point  out  here. 

First,  we  note  that  we  obtain  a  slightly  better  scaling  law.  Namely 

b1{n)n1~a^2  <  p*{n)  <  &2(n)n1-"/2 


with 

h(n)  >n-°(logi"1/2(n)), 

(n)  =  0(log6(n)), 

for  any  5  G  (0, 1/2)  obtained  here,  compared  to 

6i(n)n1-Q/2  <  p*(n )  <  &2(n)n1_a/2 


with 


bi(n)  =  f l(n  £), 
b2(n)  =  0(n£), 

for  any  £  >  0  in  [38].  For  the  lower  bound  (i.e.,  achievability) ,  this  is  because  the 
hierarchy  here  is  not  of  fixed  depth  L  as  in  [38],  but  rather  of  depth  L{n)  =  log1/^2_<5(n) 
(for  some  constant  5  €  (0,1/2)),  i.e.,  changing  with  n.  For  the  upper  bound  (i.e., 
converse),  this  is  clue  to  a  sharpening  of  the  arguments  in  [38]. 

Second,  note  that  the  multi-user  decoding  at  the  relay  subsquares  during  the 
MAC  phase  and  the  multi-user  encoding  during  the  BC  phase  are  very  simple  in  our 
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setup.  In  fact,  using  matched  filter  receivers  and  transmit  beamforming,  we  convert 
the  multi-user  encoding  and  decoding  problems  into  several  single-user  decoding  and 
encoding  problems.  This  differs  from  the  approach  in  [38],  in  which  joint  decoding 
of  a  number  of  users  on  the  order  of  the  network  size  is  performed.  Our  results  thus 
imply  that  these  simpler  transmitter  and  receiver  structures  provide  the  same  scaling 
as  the  more  complicated  joint  decoding  in  [38].  We  note  that  the  scheme  proposed 
in  [38]  can  be  modified  to  also  use  matched  filter  receivers  as  suggested  here. 

Third,  and  probably  most  important,  the  schemes  differ  in  how  they  achieve  the 
throughput  gain  from  using  multiple  antennas.  In  [38],  the  nodes  are  located  almost 
regularly  with  high  probability.  This  allowed  the  use  of  a  scheme  in  which  a  source 
subsquare  directly  communicates  with  a  destination  subsquare.  In  other  words,  the 
multiple-antenna  gain  comes  from  setting  up  a  virtual  MIMO  channel  between  the 
source  and  the  destination.  In  our  setup,  the  arbitrary  location  of  nodes  prevents  such 
an  approach.  Instead,  we  use  that  at  least  some  fixed  fraction  of  subsquares  is  almost 
regular  (we  called  them  dense  subsquares).  Source-destination  pairs  relay  their  traffic 
over  such  a  dense  subsquare.  In  other  words,  the  multiple-antenna  gain  comes  from 
setting  up  a  virtual  multiple-antenna  MAC  and  BC.  Thus,  the  hierarchical  relaying 
scheme  presented  here  shows  that  considerably  less  structure  on  the  node  locations 
than  assumed  in  [38]  suffices  to  achieve  a  multiple-antenna  gain  essentially  on  the 
order  of  the  network  size.  Note  also  that  the  additional  degree  of  freedom  offered  by 
the  choice  of  relay  subsquare  for  a  given  source-destination  pair  makes  it  possible  to 
extend  the  result  to  hold  also  for  slow  fading  channels. 

3.10.6  Design  Guidelines 

The  results  presented  in  this  chapter  suggest  the  following  design  guidelines  for  com¬ 
munication  schemes  for  large  wireless  networks.  First,  in  the  low  path-loss  regime, 
cooperative  communication  is  necessary,  and  can  be  achieved  regardless  of  the  reg¬ 
ularity  of  the  node  placement.  This  cooperative  communication  is  implemented  by 
finding  regions  in  the  wireless  network  that  contain  many  nodes,  and  in  which  a 
hierarchical  scheme  can  be  used. 
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Second,  in  the  high  path-loss  regime,  multi-hop  communication  should  be  used 
whenever  the  node  placement  is  regular  enough  for  this  to  be  possible.  However, 
for  less  regular  networks,  the  use  of  more  complicated  cooperative  communication 
schemes  can  be  necessary  for  optimal  operation  of  the  network.  This  is  due  to  large 
gaps  or  irregularities  in  the  node  placement  that  make  the  use  of  multi-hop  commu¬ 
nication  inefficient. 


3.11  Chapter  Summary 

We  considered  the  problem  of  the  scaling  of  achievable  rates  in  arbitrary  extended 
wireless  networks.  We  generalized  the  hierarchical  cooperative  communication  scheme 
presented  in  [38]  for  a  fast  fading  channel  model  and  with  random  node  placements. 
We  proposed  a  different  hierarchical  cooperative  communication  scheme,  which  also 
works  for  arbitrary  node  placement  (with  a  minimum-separation  requirement)  and 
for  either  fast  or  slow  fading. 

For  small  path-loss  exponent  a  E  (2,3],  we  showed  that  our  scheme  is  order 
optimal  and  achieves  the  same  rate  irrespective  of  the  node  placement.  In  particular, 
this  rate  is  equal  to  the  one  achievable  under  random  node  placement.  In  other 
words,  the  regularity  of  the  node  placement  has  no  impact  on  achievable  rates  for 
small  path-loss  exponent. 

The  situation  is,  however,  quite  different  for  large  path-loss  exponent  a  >  3. 
We  argued  that  in  this  regime  the  regularity  of  the  node  placement  directly  impacts 
the  scaling  of  achievable  rates.  We  then  presented  a  cooperative  communication 
scheme  that  smoothly  “interpolates”  between  multi-hop  and  hierarchical  cooperative 
communication  depending  on  the  regularity  of  the  node  placement.  We  showed  that 
this  scheme  is  order  optimal  for  all  a  >  3  under  adversarial  node  placement  with 
regularity  constraint.  This  contrasts  with  the  situation  for  more  regular  networks 
(like  the  ones  obtained  with  high  probability  through  random  node  placement),  in 
which  multi-hop  communication  is  order  optimal  for  all  a  >  3. 
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Chapter  4 


Traffic  Heterogeneity 


In  this  chapter,  we  analyze  the  scaling  of  the  n2-dimensional  unicast  capacity  region 
Auc(n)  of  a  wireless  network  of  n  randomly  placed  nodes  under  a  Gaussian  fading 
channel  model. 

As  a  first  result  of  this  chapter,  we  present  an  inner  and  an  outer  bound  on  the 
unicast  capacity  region.  These  bounds  coincide  in  the  scaling  sense  along  at  least  n2  — 
n  out  of  n  dimensions  (corresponding  to  balanced  traffic)  in  the  low  path-loss  regime 
a  G  (2,  5]  and  along  all  n2  dimensions  in  the  high  path-loss  regime  a  >  5.  These 
inner  and  outer  bounds  approximate  the  unicast  capacity  region  by  a  polytope  with 
less  than  2 n  faces,  each  corresponding  to  a  distinct  cut  (i.e.,  a  subset  of  nodes)  in  the 
wireless  network.  This  polyhedral  characterization  provides  a  succinct  approximate 
description  of  the  unicast  capacity  region  even  for  large  values  of  n.  Moreover,  it 
shows  that  for  balanced  traffic  or  a  >  5  only  2 n  out  of  2”  possible  cuts  in  the 
wireless  network  are  asymptotically  relevant  and  reveals  the  geometric  structure  of 
these  relevant  cuts. 

Second,  we  establish  the  approximate  equivalence  of  the  wireless  network  and  a 
wireline  tree  graph,  in  the  sense  that  traffic  can  be  transmitted  reliably  over  the  wire¬ 
less  channel  if  and  only  if  approximately  the  same  traffic  can  be  routed  over  the  tree 
graph.  This  equivalence  is  the  key  component  in  the  derivation  of  the  approximation 
result  for  the  unicast  capacity  region  and  provides  insight  into  the  structure  of  large 
wireless  networks. 
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Third,  we  propose  a  novel  three-layer  communication  architecture  that  achieves 
(in  the  scaling  sense)  the  entire  unicast  capacity  region.  The  top  layer  of  this  scheme 
treats  the  wireless  network  as  the  aforementioned  tree  graph  and  routes  messages  be¬ 
tween  sources  and  their  destinations  —  dealing  with  heterogeneous  traffic  demands. 
The  middle  layer  of  this  scheme  provides  this  tree  abstraction  to  the  top  layer  by  ap¬ 
propriately  distributing  and  concentrating  traffic  over  the  wireless  network  —  choos¬ 
ing  the  level  of  cooperation  in  the  network.  The  bottom  layer  implements  this  distri¬ 
bution  and  concentration  of  messages  in  the  wireless  network  —  dealing  with  interfer¬ 
ence  and  noise.  The  approximate  optimality  of  this  three-layer  architecture  implies 
that  a  separation  based  approach,  in  which  routing  is  performed  independently  of  the 
physical  layer,  is  order-optimal  for  balanced  traffic  or  in  the  high  path-loss  regime 
a  >  5. 

4.0.1  Organization 

The  remainder  of  this  chapter  is  organized  as  follows.  Section  4.1  presents  the  main 
results  of  this  chapter.  We  illustrate  the  strength  of  these  results  in  Section  4.2  by 
analyzing  various  example  scenarios  with  heterogeneous  unicast  traffic  patterns  for 
which  no  scaling  results  were  previously  known.  Section  4.3  provides  a  high-level 
description  of  the  proposed  communication  scheme.  Sections  4. 4-4. 6  contain  proofs. 
Finally,  Sections  4.7  and  4.8  contain  discussions  and  concluding  remarks. 


4.1  Main  Results 

In  this  section,  we  present  the  main  results  of  this  chapter.  In  Section  4.1.1,  we  provide 
inner  and  outer  bounds  on  the  n2-dimensional  unicast  capacity  region  A uc(n)  of  the 
wireless  network,  resulting  in  a  scaling  characterization  for  either  most  (for  a  G  (2,  5]) 
or  all  (for  a  >  5)  dimensions  of  Auc(n).  In  Section  4.1.2,  we  discuss  implications  of 
these  results  on  the  behavior  of  the  unicast  capacity  region  for  large  values  of  n.  In 
Section  4.1.3,  we  consider  computational  aspects. 
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4.1.1  Unicast  Capacity  Region 


Here  we  present  an  inner  and  outer  bound  on  the  n2-dimensional  unicast  capacity 
region  Auc(n).  We  show  that  these  bounds  are  tight  (in  the  scaling  sense)  along 
n2  —  n  out  of  the  total  n 2  dimensions  for  a  G  (2,  5),  and  are  tight  (again  in  the  scaling 
sense)  along  all  n2  dimensions  for  a  >  5. 


Our  approximate  characterization  of  the  unicast  capacity  region  will  be  given  in 
terms  of  the  total  traffic  across  various  regions  in  the  network.  To  this  end,  we 
introduce  some  notation.  Partition  A(n)  into  several  square-grids.  The  Oth  square- 
grid  divides  A{n)  into  4£  squares,  each  of  sidelength  denoted  by  {A^n)}^. 

Let  V^i{n)  C  V(n)  be  the  nodes  in  A^n)  (see  Figure  4-1).  The  square  grids  in  levels 
t  G  {1, . . . ,  L(n)}  with 

L(n)  =  ^  log (n)  (1  -  log- 1/2 (n)) ,  (4.1) 

will  be  of  particular  importance.  Note  that  L{n )  is  chosen  such  that 

4-h(n)n  =  nlog-1/2(n)> 


and  hence 

lim  I^L(n),i(n)l  =  lim  4 ~L{n)'n  =  00- 

n— >oo  v  n  n^oo 

while  at  the  same  time 

I^Z(n),»l  =  4'L(n)n<no(1), 

as  n  — >  oo.  In  other  words,  the  area  of  the  region  at  level  i  =  L(n )  grows 

to  infinity  as  n  — >  oo,  but  much  slower  than  n. 
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Figure  4-1:  Square-grids  with  0  <  £  <  2.  The  grid  at  level  t  =  0  is  the  area  A{n) 
itself.  The  grid  at  level  t  =  1  is  indicated  by  the  dashed  lines.  The  grid  at  level 
t  =  2  by  the  dashed  and  the  dotted  lines.  Assume  for  the  sake  of  example  that  the 
subsquares  are  numbered  from  left  to  right  and  then  from  bottom  to  top  (the  precise 
order  of  numbering  is  immaterial).  Then  Vo^(n)  are  all  the  nodes  V(n ),  V\t\(n)  are 
the  nine  nodes  in  the  lower  left  corner  (separated  by  dashed  lines),  and  V^^n)  are 
the  three  nodes  in  the  lower  left  corner  (separated  by  dotted  lines). 


We  are  now  ready  to  define  the  approximate  unicast  capacity  region.  Let 


E  E  (Ah  +  AS  < 

ue\ Ei(n)  vtve,i(,n) 


W  G  {1, . . . ,  L(n)},i  G  {1, . . . ,  4^}, 


A  1 

VJ -U 

Vu  G  V  ( n )  | , 


(4.2) 


and 


E  R 


nxn 

+ 


Auv  ^  (4_^n)2_min{3’"}/2 

«eV£,i(n)  v£Veti(n) 


W  e  {1, . . . ,  L(n)},i  G  {1, . . . ,  4^}, 


-  1 

V^U 

VueV  (n)  | . 
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A^c(n)  and  A2C(n)  are  the  collection  of  all  unicast  traffic  matrices  Auc  such  that  for 
various  cuts  S  C  V ( n )  in  the  network,  the  total  traffic  demand  (in  either  one  or  both 
directions) 


u£S  v^S 

E  E(AS  + 

u£S  v£S 

across  the  cut  S  is  not  too  big.  Note  that  the  number  of  cuts  S  we  need  to  consider  is 
actually  quite  small.  In  fact,  there  are  at  most  n  sets  {V^n)}^  for  i  e  {1 , ,L(n)}. 
Hence  A)JC(n)  and  A^n)  are  described  by  2 n  cuts. 

The  next  theorem  shows  that  A^c(n)  is  an  approximate  (in  the  scaling  sense) 
inner  bound  and  A^n)  is  an  approximate  outer  bound  to  the  unicast  capacity  region 
Auc(n)  of  the  wireless  network. 

Theorem  4.1.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exist 

hi  (■ n )  >  n~o(^ , 

h2  (n)  =  0(log6(n)), 

such  that 

6i(n)A)JC(n)  C  Auc(n)  C  b2(;n)A2C(n), 
with  probability  1  —  o(l)  as  n  — >  oo. 

We  point  out  that  Theorem  4.1  holds  only  with  probability  1  —  o(l)  for  different 
reasons  for  the  fast  and  slow  fading  cases.  Under  fast  fading,  the  theorem  holds  only 
for  node  placements  that  are  “regular  enough” .  The  node  placement  itself  is  random, 
and  we  show  that  the  required  regularity  property  is  satisfied  with  high  probability  as 
n  — >  oo.  Under  slow  fading,  the  theorem  holds  under  the  same  regularity  requirements 
on  the  node  placement,  but  now  it  also  only  holds  with  high  probability  for  the 
realization  of  the  fading  {9u,v}u,v 

Comparing  the  expression  for  A^c(n)  and  A2C(n),  we  see  that  whenever  a  traffic 
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matrix  Auc  satisfies 


E  E  (-C + «)  <  n*‘>  E  E  (4.3) 

«el ^,i(n)  v£Viti(n)  MS Ve,i(ra)  v£Vtti(n) 

for  all  i  G  {1, . . . ,  L(n)}  and  i  G  {1, . .  .4/},  then  Auc  G  A^n)  implies  n_°^1^Auc  G 
A^c(n),  and  hence  for  such  traffic  matrices  the  inner  and  outer  bounds  in  Theorem 
4.1  coincide  in  the  scaling  sense.  In  particular,  this  applies  for  traffic  matrices  Auc 
such  that  (4.3)  holds  with  equality,  and  we  call  such  traffic  (approximately)  balanced 
in  the  following.  Note  that  the  condition  of  balanced  traffic  imposes  (less  than)  n 
linear  constraints  on  Auc,  and  hence  Theorem  4.1  provides  the  correct  scaling  of  the 
n2- dimensional  unicast  capacity  region  Auc(n)  along  at  least  n 2  —  n  dimensions. 

In  the  high  path- loss  exponent  regime  (a  >  5),  it  can  be  shown  that  A^c(n)  is 
also  an  approximate  outer  bound. 

Theorem  4.2.  Under  either  fast  or  slow  fading,  for  any  a  >  5,  there  exists 

h(n)  =  0(log6(n)), 


such  that 

Auc(n)  C  b3(n) A^c(n), 
with  probability  1  —  o(l)  as  n  — >  oo. 

Combining  Theorems  4.1  and  4.2  provides  a  tight  scaling  characterization  for 
a  >  5  of  the  entire  unicast  capacity  region  Auc(n)  of  the  wireless  network  as  depicted 
in  Figure  4-2.  The  approximations  in  both  theorems  are  within  a  factor  n±0W.  This 
factor  can  be  sharpened  as  is  discussed  in  detail  in  Section  4.7.2. 

4.1.2  Implications  of  Theorem  4.1  and  4.2 

Theorems  4.1  and  4.2  can  be  applied  in  two  ways.  First,  the  theorems  can  be  used  to 
analyze  the  asymptotic  achievability  of  a  sequence  of  traffic  matrices.  Let  {Auc(n)}n>i 
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Figure  4-2:  For  a  >  5,  the  set  A^c(n)  approximates  the  unicast  capacity  region  Auc(n) 
of  the  wireless  network  in  the  sense  that  6i(n)A^c(n)  (with  b\(n)  >  n~°^)  provides 
an  inner  bound  to  Auc(n)  and  bs(n)A^c(n)  (with  63(n)  =  0(log6(n)))  provides  an 
outer  bound  to  Auc(n).  The  figure  shows  two  dimensions  (namely  and  -^21)  °f 
the  n2-dimensional  set  Auc(n).  The  same  approximation  result  holds  for  a  G  (2,5] 
along  at  least  n 2  —  n  out  of  n2  total  dimensions. 


be  a  sequence  of  unicast  traffic  matrices  with  Auc(n)  G  M"xn.  Dehne 

P*\uc(n)  =  sup{p  :  pAuc(n)  G  Auc(n)}, 

P\vc(n)  =  sup{p  :  pAuc(n)  G  A^c(n)}, 


i.e.,  p*uc  (ri)  is  the  largest  multiplier  p  such  that  the  scaled  traffic  matrix  pAuc(n)  is 
contained  in  Auc(n)  (and  similar  for  p*uc  (n)  with  respect  to  A^c(n)).  Then  Theorems 
4.1  and  4.2  provide  asymptotic  information  about  the  achievability  of  {Auc(n)}n>i  in 
the  sense  that  if  either  the  Auc(n)  are  balanced  or  if  a  >  5  then1 


lim 

n— kx) 


log(n) 


lim 

n— >  00 


log(pluc(n)) 

log(n) 


Second,  Theorems  4.1  and  4.2  provide  information  about  the  shape  of  the  unicast 
capacity  region  Auc(n).  We  now  argue  that  even  though  the  approximation  A'jJC(n) 
of  Auc(n)  is  only  up  to  n°^  scaling,  its  shape  is  largely  preserved. 


1We  assume  here  that  the  limits  exist,  otherwise  the  same  statement  holds  for  lim  sup  and  lim  inf. 
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To  illustrate  this  point,  consider  a  rectangle 


R(n)  =  [0, Ti(n)]  x  [0,r2(n)j, 


and  let 

R(n)  =  [0,  f i(n)]  x  [0,f2(n)], 

where 

fi  =  bi(n)ri(n ) 

for  some  bi(n)  =  n±0^\  be  its  approximation.  The  shape  of  R(n )  is  then  determined 
by  the  ratio  between  ri(n)  and  r2(n).  For  example,  assume  ri(n)  =  ?hV2(n).  Then 

fiM  =  nw)  =  n^ii!iM 

r2(n)  r2(n)  ’ 


i.e., 


lim 


log  (r i(n)/r 2(n)) 
log(n) 


=  f3  —  lim 


log  (f \{n)/f 2 (n)) 
log(n) 


and  hence  the  approximation  i?(n)  preserves  the  exponent  of  the  ratio  of  sidelengths 
of  R(n).  In  other  words,  if  the  two  sidelengths  ri(n)  and  r2(n)  differ  on  exponential 
scale  (i.e.,  by  a  factor  nd  for  /3  ^  0)  then  this  shape  information  is  preserved  by  the 
approximation  R{n). 


Let  us  now  return  to  the  unicast  capacity  region  Auc(n)  and  its  approximation 
A^c(n).  We  consider  several  boundary  points  of  Auc(n)  and  show  that  their  behavior 
varies  at  scale  'nP  for  different  values  of  (5.  From  the  discussion  in  the  previous 
paragraph,  this  implies  that  a  significant  part  of  the  shape  of  Auc(n)  is  preserved 
by  its  approximation  A^c(n).  First,  let  Auc  =  pin)!  for  some  scalar  pin)  depending 
only  on  n,  and  where  1  is  the  n  x  n  matrix  of  all  ones.  If  Auc  G  Auc(n)  then  the 
largest  achievable  value  of  p(n)  is  p*(n)  <  n-mm{3>a}/2+°(1)  (by  applying  Theorem 
4.1).  Second,  let  Auc  such  that  A^£w*  =  p(n)  for  only  one  source-destination  pair 
(■ u*,w *)  with  u*  p  w*  and  A^  =  0  otherwise.  Then  p*(n),  the  largest  achievable 
value  of  p(n),  satisfies  p*(n)  >  n~°^  (again  by  Theorem  4.1).  Hence  the  boundary 
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points  of  Auc(n)  vary  at  least  from  n  min{3>Q}/2+°(1)  to  n  and  this  variation  on 
exponential  scale  is  preserved  by  A^c(n). 

4.1.3  Computational  Aspects 

Since  we  are  interested  in  large  wireless  networks,  computational  aspects  are  of  im¬ 
portance.  In  this  section,  we  show  that  the  region  AljJC(n)  can  be  efficiently  described. 
By  Theorems  4.1  and  4.2,  this  provides  a  computationally  efficient  approximate  de¬ 
scription  of  the  entire  unicast  capacity  region  Auc(n)  for  a  >  5  and  of  n2  —  n  of  its 
n2  dimensions  for  a  G  (2,5]. 

Note  that  Auc(n)  is  a  n2-dimensional  set,  and  hence  its  shape  could  be  rather 
complex.  In  particular,  in  the  special  cases  where  the  capacity  region  is  known,  its 
description  is  often  in  terms  of  cut-set  bounds.  Since  there  are  2”  possible  subsets 
of  n  nodes,  there  are  2n  possible  cut-set  bounds  to  be  considered.  In  other  words, 
the  description  complexity  of  Auc(n)  is  likely  to  be  growing  exponentially  in  n.  On 
the  other  hand,  as  was  pointed  out  in  Section  4.1.1,  the  description  of  A^c(n)  is  in 
terms  of  only  2 n  cuts.  This  implies  that  A^c(n)  can  be  computed  efficiently  (i.e.,  in 
polynomial  time  in  n) .  Hence  even  though  the  description  complexity  of  Auc(n)  is 
likely  to  be  of  order  @(2n),  the  description  complexity  of  its  approximation  A'[JC(n) 
is  only  of  order  0(rt)  —  an  exponential  reduction.  In  particular,  consider  a  unicast 
traffic  matrix  Auc  and  assume  that  either  a  >  5  or  that  Auc  is  balanced,  then  this 
implies  that  membership  Auc  G  Auc(n)  (and  hence  by  Theorems  4.1  and  4.2  also 
the  approximate  achievability  of  the  unicast  traffic  matrix  Auc)  can  be  computed  in 
polynomial  time  in  the  network  size  n.  More  precisely,  evaluating  each  of  the  O(n) 
cuts  takes  at  most  0(n2)  operations,  yielding  a  Q(n3)-time  algorithm  for  approximate 
testing  of  membership  in  Auc(n). 

4.2  Example  Scenarios 

We  next  illustrate  the  strength  of  the  above  results  by  determining  achievable  rates 
in  a  few  specific  wireless  network  scenarios  with  non-uniform  traffic  patterns.  We 
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illustrate  the  impact  on  achievable  rates  of  various  sources  of  traffic  heterogeneities 
—  variation  of  distance  between  source-destination  pairs,  variation  of  amount  of  traffic 
between  different  pairs,  sources  with  multiple  destinations. 

Example  4.1.  Multiple  classes  of  source-destination  pairs 

There  are  K  classes  of  source-destination  pairs,  for  some  fixed  K .  Each  source 
node  in  class  %  generates  traffic  at  the  same  rate  pt(n)  for  a  destination  node  that 
is  chosen  randomly  within  distance  ©(n^/2),  for  some  fixed  (3i  G  [0,1].  Each  node 
randomly  picks  the  class  it  belongs  to.  The  resulting  traffic  matrix  is  approximately 
balanced  with  high  probability,  and  applying  Theorem  4.1  shows  that  p*(n ),  the 
largest  achievable  value  of  Pi(n),  satisfies 

p*(n)  =  nft(l-a/2)±o(l), 

with  probability  1  —  o(l)  for  all  i,  and  where 

a  =  min{3,  a}. 


Hence,  for  a  fixed  number  of  classes  K,  source  nodes  in  each  class  can  obtain  rates 
as  a  function  of  only  the  source-destination  separation  in  that  class. 

Set  hi  =  nPl ,  and  note  that  h*  is  on  the  order  of  the  expected  number  of  nodes 
that  are  closer  to  a  source  than  its  destination.  Then 


(  \  zbo(l)  ~  1— Ck/2 

n)  =  n  y  ’n. 


Now  h\  a is  precisely  the  per-node  rate  that  is  achievable  for  an  extended  network 
with  hi  nodes  under  random  source-destination  pairing  [38] .  In  other  words,  the  local 
traffic  pattern  here  allows  us  to  obtain  a  rate  that  is  as  good  as  the  one  achievable 
under  random  source-destination  pairing  for  a  much  smaller  network.  0 

Example  4.2.  Traffic  variation  with  source- destination  separation 

Assume  each  node  is  source  for  exactly  one  destination  chosen  uniformly  at  random 
from  among  all  the  other  nodes  (as  in  the  traditional  setting).  However,  instead  of  all 
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sources  generating  traffic  at  the  same  rate,  source  node  u  generates  traffic  at  a  rate 
that  is  a  function  of  its  separation  from  destination  w,  i.e.,  the  traffic  matrix  is  given 
by  rU)W )  for  some  function  ip.  In  particular,  let  us  consider 

{  r13  if  r  >  1, 

1  else 

for  some  fixed  (3  G  M  and  some  p(n)  depending  only  on  n.  The  traditional  setting 
corresponds  to  (3  —  0,  in  which  case  all  n  source-destination  pairs  communicate  at 
uniform  rate.  Such  traffic  is  approximately  balanced  with  high  probability,  and  using 
Theorem  4.1  establishes  the  scaling  of  p*(n),  the  largest  achievable  value  of  p(n),  as 

f  ni-(a+/?)/2±o(i)  if  p  >  2  -  a, 

P*(n)  =  \ 

I  n^l1)  else, 

with  probability  1  —  o(l).  For  (3  =  0,  and  noting  that  2  —  a  <  0,  this  recovers  the 
results  from  [38]  for  random  source-destination  pairing  with  uniform  rate.  0 

Example  4.3.  Source- destination  separation  variation 

Each  node  generates  traffic  at  the  same  rate  p(n).  For  each  source  u  we  pick  one 
destination  w(u)  independently  at  random  at  distance  s  with  density  ip(s),  i.e.,  for 
all  re  [0,  y/n\ 

W{ru,w<p)  <  r\V(n))  «  [  ip(s)ds,  (4.4) 

Z[n)  Js=1 

for 

ip(r)  =  r'3, 

for  some  hxed  (3  G  M,  and  for  normalization  constant 

rs/n 

Z(n )  =  /  ,ip(s)ds 
Js= 1 

(the  relation  is  only  approximate  since  the  number  of  nodes  is  finite).  Note  that  the 
node  placement  V(n)  in  (4.4)  is  hxed,  and  hence  so  are  ru>v  for  all  pairs  u,v  €  V(n). 
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The  randomness  in  (4.4)  is  due  to  the  random  choice  of  destination  w(u)  for  source 
u.  Note  also  that  the  traditional  setup  of  choosing  destinations  uniformly  at  random 
from  among  all  other  nodes  corresponds  essentially  to  (3  —  1.  Finally,  note  that  this 
traffic  is  approximately  balanced  with  high  probability.  The  scaling  of  p*(n),  the 
largest  achievable  p(n),  is  thus  given  by  Theorem  4.1  as 


nl— <5/2±o(l) 

if  P  >  -1, 

n(l— 5— 0)/2±o(l) 

if  1  —  a  <  /3 

n±°(!) 

else, 

with  probability  1  —  o(l).  For  (3  —  1  this  coincides  again  with  the  results  from  [38] 
for  random  source-destination  pairing  with  uniform  rate.  0 


Example  4.4.  Sources  with  multiple  destinations 

All  the  example  scenarios  so  far  are  concerned  with  traffic  in  which  each  node  is 
source  exactly  once.  Here  we  consider  more  general  traffic  patterns.  There  are  K 
classes  of  source  nodes,  for  some  fixed  K .  Each  source  node  in  class  i  has  0 (rrl ) 
destination  nodes  for  some  fixed  $  G  [0, 1]  and  generates  independent  traffic  at  the 
same  rate  Pi(n)  for  each  of  them  (i.e.,  we  still  consider  unicast  traffic).  Each  of  these 
destination  nodes  is  chosen  uniformly  at  random  among  the  n  —  1  other  nodes.  Every 
node  randomly  picks  the  class  it  belongs  to.  Noting  that  the  resulting  traffic  matrix 
is  approximately  balanced  with  high  probability,  Theorem  4.1  provides  the  following 
scaling  of  the  rates  achievable  by  different  classes: 

p*  (n)  =  n1-ft-«/2±°(1)) 

with  probability  1  —  o(l)  for  all  i.  In  other  words,  for  each  source  node  time  sharing 
between  all  K  classes  and  then  (within  each  class)  between  all  its  0 (rd3* )  destination 
nodes  is  order-optimal  in  this  scenario.  However,  different  sources  are  operating 
simultaneously.  0 
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4.3  Communication  Scheme  for  Unicast  Traffic 


In  this  section,  we  provide  a  high-level  description  of  the  communication  scheme  used 
to  prove  achievability  (i.e.,  the  inner  bound)  in  Theorem  4.1.  This  scheme  has  a  tree 
structure,  that  makes  it  convenient  to  work  with.  This  tree  structure  is  crucial  in 
proving  the  compact  approximation  of  the  unicast  capacity  region  Auc(n)  in  Theorems 
4.1  and  4.2.  The  communication  scheme  for  general  unicast  traffic  uses  as  a  building 
block  the  multi-hop  and  hierarchical  relaying  schemes  for  random  source-destination 
pairing  with  uniform  rate  discussed  in  Chapter  3. 

The  communication  scheme  consists  of  three  layers:  A  top  or  routing  layer,  a 
middle  or  cooperation  layer,  and  a  bottom  or  physical  layer.  The  routing  layer  of  this 
scheme  treats  the  wireless  network  as  a  tree  graph  G  and  routes  messages  between 
sources  and  their  destinations  —  dealing  with  heterogeneous  traffic  demands.  The 
cooperation  layer  of  this  scheme  provides  this  tree  abstraction  G  to  the  top  layer 
by  appropriately  distributing  and  concentrating  traffic  over  the  wireless  network  - 
choosing  the  level  of  cooperation  in  the  network.  The  physical  layer  implements  this 
distribution  and  concentration  of  messages  in  the  wireless  network  —  dealing  with 
interference  and  noise. 

Seen  from  the  routing  layer,  the  network  consists  of  a  noiseless  capacitated2  graph 
G.  This  graph  is  a  tree,  whose  leaf  nodes  are  the  nodes  V ( n )  in  the  wireless  network. 
The  internal  nodes  of  G  represent  larger  clusters  of  nodes  (i.e.,  subsets  of  V(n))  in 
the  wireless  network.  More  precisely,  each  internal  node  in  G  represents  a  set  (n) 
for  £  E  {1, . . . ,  L(n )}  and  i  e  (1, . . . ,  4£}.  Consider  two  sets  V^j(n),  Vg+ij(n)  and  let 
u,  fi  be  the  corresponding  internal  nodes  in  G.  Then  v  and  //  are  connected  by  an 
edge  in  G  if  V^+l  j  (n)  C  Vt^n).  Similarly,  for  U(«)  i(n)  and  corresponding  internal 
node  v  in  G,  a  leaf  node  u  in  G  is  connected  by  an  edge  to  v  if  u  G  I4.,i(n)  (recall  that 
the  leaf  nodes  of  G  are  the  nodes  V(n)  in  the  wireless  network).  This  construction 
is  shown  in  Figure  4-3.  In  the  routing  layer,  messages  are  sent  from  each  source  to 
its  destination  by  routing  them  over  G.  To  send  information  along  an  edge  of  G,  the 

2  A  graph  G  =  ( Vg,Eq )  is  capacitated  if  every  edge  e  €  Eg  is  associated  with  a  nonnegative 
capacity  ce. 
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Figure  4-3:  Construction  of  the  tree  graph  G.  We  consider  the  same  nodes  as  in  Figure 
4-1  with  L{n )  =  2.  The  leaves  of  G  are  the  nodes  V(n)  of  the  wireless  network.  They 
are  always  at  level  £  =  L(n)  +  1  (i.e.,  3  in  this  example).  At  level  0  <  l  <  L(n)  in  G, 
there  are  4£  nodes.  The  tree  structure  is  the  one  induced  by  the  grid  decomposition 
as  shown  in  Figure  4-1.  Level  0  contains  the  root  node  of  G. 

routing  layer  calls  upon  the  cooperation  layer. 

The  cooperation  layer  implements  the  tree  abstraction  G.  This  is  done  by  ensuring 
that  whenever  a  message  is  located  at  a  node  in  G,  it  es  evenly  distributed  over  the 
corresponding  cluster  in  the  wireless  network,  i.e.,  every  node  in  the  cluster  has  access 
to  a  distinct  part  of  equal  length  of  the  message.  To  send  information  from  a  child 
node  to  its  parent  in  G  (i.e.,  towards  the  root  node  of  G),  the  message  at  the  cluster 
in  V (n)  represented  by  the  child  node  is  distributed  evenly  among  all  nodes  in  the 
bigger  cluster  in  V ( n )  represented  by  the  parent  node.  More  precisely,  let  v  be  a  child 
node  of  //  in  G,  and  let  V^+i^n),  Vij(n)  the  corresponding  subsets  of  V(n).  Consider 
the  cooperation  layer  being  called  by  the  routing  layer  to  send  a  message  from  v  to 
its  parent  fi  over  G.  In  the  wireless  network,  we  assume  each  node  in  V^+\^{n)  has 
access  to  a  distinct  l/|W+i,i(n)|  fraction  of  the  message  to  be  sent.  Each  node  in 
Vg+i }i(n)  splits  its  message  part  into  four  distinct  parts  of  equal  length.  It  keeps  one 
part  for  itself  and  sends  the  other  three  parts  to  three  nodes  in  V^-(n)  \  W+mC n). 
After  each  node  in  W+m (n)  has  sent  their  message  parts,  each  node  in  Vej(n)  now 
as  access  to  a  distinct  l/|V^j(n)|  fraction  of  the  message.  To  send  information  from 
a  parent  node  to  a  child  node  in  G  (i.e.,  away  from  the  root  node  of  G),  the  message 
at  the  cluster  in  V (n)  represented  by  the  parent  node  is  concentrated  on  the  cluster 
in  V{n)  represented  by  the  child  node.  More  precisely,  consider  the  same  nodes  v 
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and  [i  in  G  corresponding  to  U+i ,i(n)  and  Vgj(n)  in  V(n).  Consider  the  cooperation 
layer  being  called  by  the  routing  layer  to  send  a  message  from  p  to  its  child  u.  In  the 
wireless  network,  we  assume  each  node  in  Vgj(n)  has  access  to  a  distinct  \/\Vp^(n)\ 
fraction  of  the  message  to  be  sent.  Each  node  in  Vgj(n)  sends  its  message  part  to 
another  node  in  Vg+iti(n).  After  each  node  in  Vgj(n)  has  sent  their  message  part,  each 
node  in  Vg+\ ti(n)  now  as  access  to  a  distinct  l/\Vg+\^{n)\  fraction  of  the  message.  To 
implement  this  distribution  and  concentration  of  messages,  the  cooperation  layer  calls 
upon  the  physical  layer. 

The  physical  layer  performs  the  distribution  and  concentration  of  messages.  Note 
that  the  traffic  induced  by  the  cooperation  layer  in  the  physical  layer  is  very  uni¬ 
form,  and  closely  resembles  a  permutation  traffic  (in  which  each  node  in  the  wireless 
network  is  source  and  destination  once  and  all  these  source-destination  pairs  want  to 
communicate  at  equal  rate).  Hence  we  can  use  either  cooperative  communication  (for 
a  G  (2,  3])  or  multi-hop  communication  (for  a  >  3)  for  the  transmission  of  this  traffic. 
See  Chapter  3  for  a  detailed  description  of  these  two  schemes.  It  is  this  operation  in 
the  physical  layer  that  determines  the  edge  capacities  of  the  graph  G  as  seen  from 
the  routing  layer. 

The  operation  of  this  three-layer  architecture  is  illustrated  in  the  following  exam¬ 
ple. 

Example  4.5.  Consider  a  single  source-destination  pair  (u,v).  The  corresponding 
operation  of  the  three- layer  architecture  is  depicted  in  Figure  4-4. 

In  the  routing  layer,  the  message  is  routed  over  the  tree  graph  G  between  u  and  w 
(indicated  in  black  in  the  figure).  The  middle  plane  in  the  figure  shows  the  induced 
behavior  from  using  the  second  edge  along  this  path  (indicated  in  solid  black  in  the 
figure)  in  the  cooperation  layer.  The  bottom  plane  in  the  figure  shows  (part  of)  the 
corresponding  actions  induced  in  the  physical  layer.  Let  us  now  consider  the  specific 
operations  of  the  three  layers  for  the  single  message  between  u  and  v.  Since  G  is  a 
tree,  there  is  a  unique  path  between  u  and  v,  and  the  routing  layer  sends  the  message 
over  the  edges  along  this  path.  Consider  now  the  first  such  edge.  Using  this  edge  in 
the  routing  layer  induces  the  following  actions  in  the  cooperation  layer.  The  node  u, 
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Figure  4-4:  Example  operation  of  the  three-layer  architecture  under  unicast  traffic. 
The  three  layers  depicted  are  (from  top  to  bottom  in  the  figure)  the  routing  layer, 
the  cooperation  layer,  and  the  physical  layer. 
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having  access  to  the  entire  message,  splits  that  message  into  3  distinct  parts  of  equal 
length.  It  keeps  one  part,  and  sends  the  other  two  parts  to  the  two  other  nodes  in 
V24 (n)  (i.e.,  lower  left  square  at  level  l  —  2  in  the  hierarchy).  In  other  words,  after  the 
message  has  traversed  the  edge  between  u  and  its  parent  node  in  the  routing  layer,  all 
nodes  in  V24 (n)  in  the  cooperation  layer  have  access  to  a  distinct  1/3  fraction  of  the 
original  message.  The  edges  in  the  routing  layer  leading  up  the  tree  (i.e.,  towards  the 
root  node)  are  implemented  in  the  cooperation  layer  in  a  similar  fashion  by  further 
distributing  the  message  over  the  wireless  network.  By  the  time  the  message  reaches 
the  root  node  of  G  in  the  routing  layer,  the  cooperation  layer  has  distributed  the 
message  over  the  entire  network  and  every  node  in  V (n)  has  access  to  a  distinct  1  jn 
fraction  of  the  original  message.  Communication  down  the  tree  in  the  routing  layer  is 
implemented  in  the  cooperation  layer  by  concentrating  messages  over  smaller  regions 
in  the  wireless  network.  To  physically  perform  this  distribution  and  concentration 
of  messages,  the  cooperation  layer  calls  upon  the  physical  layer,  which  uses  either 
hierarchical  relaying  or  multi-hop  communication.  0 


4.4  Auxiliary  Lemmas 

In  this  section,  we  provide  auxiliary  results,  which  will  be  used  several  times  in  the 
following.  These  results  are  grouped  into  three  parts.  In  Section  4.4.1,  we  describe 
regularity  properties  exhibited  with  high  probability  by  the  random  node  placement. 
In  Section  4.4.2,  we  provide  auxiliary  upper  bounds  on  the  performance  of  any  scheme 
in  terms  of  cut-set  bounds.  Finally,  in  Section  4.4.3,  we  describe  auxiliary  results  on 
the  performance  of  hierarchical  relaying  and  multi-hop  communication. 


4.4.1  Regularity  Lemmas 

Here  we  prove  several  regularity  properties  that  are  satisfied  with  high  probability 
by  a  random  node  placement.  Formally,  define  V(n)  to  be  the  collection  of  all  node 
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placements  V (n)  that  satisfy  the  following  conditions: 

ru,v  >  n_1  \/u,v  G  V(n), 

|  Ve,i(n)  |  —  l°g(n)  for  £  =  ^  log(n)  and  \/i  G  {1, ... ,  4£}, 

|V^(n)|  >1  for  i  =  ^  log  (21og/  J  and  Vi  e  ih-  *  • 

|Vi,i(n)|  G  4"£+1n]  W  G  jl, . . . ,  ^  log(n)(l  -  log"5/6(n))  j,  i  G  {1, . . . ,  Ae}. 

The  first  condition  is  that  the  minimum  separation  between  node  pairs  is  not  too 
small.  The  second  condition  is  that  all  squares  of  area  1  contain  at  most  log(n) 
nodes.  The  third  condition  is  that  all  squares  of  area  2  log(n)  contain  at  least  one 
node.  The  fourth  condition  is  that  all  squares  up  to  level  4log(n)(l  —  log  "5/6(n)) 
contain  a  number  of  nodes  proportional  to  their  area.  Note  that,  since 

L(n)  <  ^  log(n)  (l  -  log5/6(n)), 

this  holds  in  particular  for  nodes  up  to  level  L{n).  The  goal  of  this  section  is  to  prove 
that 

P (V(n)  G  V(n))  =  l-o(l), 

as  n  — >  oo. 

The  first  lemma  shows  that  the  minimum  separation  in  a  random  node  placement 
is  at  least  n_1  with  high  probability. 

Lemma  4.3. 

P(  min  ruv  >  n~l  1=1  —  o(l), 

V  u£V(n),v£V(n)\{u}  ’  / 

as  n  — >  oo. 

Proof.  For  u,  v  G  V,  let 

Bu,V  {V  U,V  — 
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for  some  r  (depending  only  on  n) .  Fix  a  node  u  £  V,  then 


r27r 


P(Buv\u)  <  , 

n 


(the  inequality  being  due  to  boundary  effects).  Moreover,  the  events  {BUjV}v£v\{u} 
are  independent  conditioned  on  u,  and  thus 


P(  rWUu}  B°tV\u)  =  P {BcuJu)  >  (l 


r27TN  n 


n 


From  this, 


P(  min  rUtV  <  r  )  =  P(  Uuey;„ey\{u}  Bu. 

-  U„6y\{u}  Bu,v 


uev 


'■oevvtu} 


\ 

u£V 

u£V 


E  !-P(n 

u£V  ' 

E  U  -  e(p(  n„ey\M  Bcuv\u 


—  nil  —  1 


r27r 


n 


which  converges  to  zero  for  r  =  n 


-1 


□ 


The  next  lemma  asserts  that  if  £{n)  is  not  too  large  then  all  squares  {V^j(n)}^ 
for  £  G  {1, . . . ,  l(n )}  and  i  G  {1, . . . ,  4£}  in  the  grid  decomposition  of  V ( n )  contain  a 
number  of  nodes  that  is  proportional  to  their  area. 


Lemma  4.4.  If  £(n)  satisfies 


lim 

n— >00 


£(n) 

4-i(n)n 


0 
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then 


l{ri)  4^ 

P(  nnil^WI  €  [4-'-1n,4-<+1n]})  =  1  -  o(l) 

^  £=  1  i=  1  ' 


as  n  — >  oo.  In  particular,  this  holds  for 


ifn)  =  -  log(n)  (l  -  log  5/6 (n)), 


and  for  £(n)  =  L{n). 


Proof.  Let  Bu  be  the  event  that  node  u  lies  in  for  fixed  £,  i.  Note  that 


u&V 


by  definition,  and  that 


Hence  using  the  Chernoff  bound 


P  (Bu)  =  4- 


[4"<-1n,4-<+1n]  ]  <  exp(-/l4^n), 


for  some  constant  K.  From  this,  we  obtain  for  I  =  f'(n), 


4«(n) 


P  H  {lV?(„),il  e  [4-^-^, A-W+'n]} 


i=  1 


4^(n) 


>  1  -  £P(IU,„),,I  *  [4-'<”>-1n,4-'(")+1n]) 


2=1 


>  1  -  4^(n)  exp(-/l 4"?(n)n) 

>  1  -  exp (K£(n)  -  K4~^n), 


for  some  constant  K .  By  assumption 


.  Kn) 

lim  — - - =  0, 


4~£(n) 


n 


(4.5) 
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and  hence 


4/0) 


W  f|  {!%),*!  e  [4-^)-1n,4-^)+1n]}  >  1  -  o(l), 


i=  1 


as  n  — >  00.  Since  the  {A^}^  are  nested  as  a  function  of  £,  we  have 

£(n)  4e  4h») 

fl  ntl^l  6  [4_f_1«,4-'+1n]}  =  fl  {l»W  6  4-'"M+1n]}, 

i=  1  i= 1  i=l 

which,  combined  with  (4.5),  proves  the  first  part  of  the  lemma. 

For  the  second  part,  note  that  for 


Kn)  =  2  los(n){1  ~  loS  5/6('«))5 


we  have 


£(n)  _  \  log(n)(l  -  log  5/6 (n)) 


4-i(n)n  2los1/6(n) 

log(n) 


< 


2log1/®(n) 

_  2loglog(n)-log1/6W  q 


and  hence  the  lemma  is  valid  in  this  case.  The  same  holds  for  £(n)  =  L{n )  since 


L(n)  <  -  log(n)  (l  -  log  5/6(n)). 


□ 


We  are  now  ready  to  prove  that  a  random  node  placement  V(n)  is  in  V(n)  with 
high  probability  as  n  oo  (i.e.,  is  fairly  “regular”  with  high  probability). 

Lemma  4.5. 

P (V(n)  E  V(n))  =  l-o(l), 

as  n  —4  oo. 
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Proof.  The  first  condition, 


rUjV  >  '«-_1  Vit,  v  G  V, 

holds  with  probability  1  —  o(l)  by  Lemma  4.3.  The  second  and  third  conditions, 

for  £  =  ^  log(n)  and  Vi  G  {1, ... ,  4£}, 

fo^  =  Ilog(2toiR)mdvie{1""'4'}' 
are  shown  in  [38,  Lemma  5.1]  to  hold  with  probability  1  —  o(l).  The  fourth  condition, 

\Ve,i\  e  [4~*_1n,4_<+1n]  W  G  |l, . . . ,  ^  log(n)(l  -  log“5/6(n))  |,  i  G  {1,...,4*}, 

holds  with  probability  1  —  o(l)  by  Lemma  4.4.  Together,  this  proves  the  desired 
result.  □ 

4.4.2  Converse  Lemmas 

Here  we  prove  several  auxiliary  converse  results.  The  first  lemma  bounds  the  maximal 
achievable  sum  rate  for  every  individual  node  (i.e.,  the  total  traffic  for  which  a  fixed 
node  is  either  source  or  destination). 

Lemma  4.6.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  b(;n)  = 
0(log(n))  such  that  for  all  V(n)  G  V[n),  Auc  G  Auc(n),  u  G  V(n), 

XuCv  <  Kn)> 

tier  (n)\{u} 

AtU,«  <  Kn)- 

v£V  (n)\{u} 

Proof.  The  argument  follows  the  one  in  [38,  Theorem  3.1].  For  any  Si,  S2  C  V,  denote 
by  C(S  1,  S2)  the  MIMO  capacity  between  nodes  in  Si  and  nodes  in  S2.  Consider  first 


(4.6) 

(4.7) 


\Ve,i\  <  log (n) 
|V*,i|  >  1 
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(4.6).  By  the  cut-set  bound  [9,  Theorem  14.10.1], 


V^U 

Here  C{{u},  (u}c)  is  the  SIMO  capacity  between  u  and  the  nodes  in  {w}c,  i.e., 

c(M,  Mc)  =  los  +  E^l^r) 

<  log(l  +  (n  —  1  )na) 

<  K  \og{n)  r 


with 

K  =  2  +  a, 


and  where  for  the  first  inequality  we  have  used  that  since  V  G  V,  we  have  rU)V  >  n 
for  all  u,v  G  V. 

Similarly,  for  (4.7), 


and 


C({u}c,  {«})  <  log  (l  +  {n  -  1)E^UK«|2) 

<  log(l  +  (n  —  1  )2na) 

<  K\og(n). 


□ 


The  next  two  lemmas  bounds  the  maximal  achievable  sum  rate  across  the  bound¬ 
ary  of  the  subsquares  V^n)  for  £  G  {1 , ,L(n)},  and  i  G  {1, . . . , 4£}. 

Lemma  4.7.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  b(n)  = 
0(log6(n))  such  that  for  all  V(n)  G  V(n),  Auc  G  Auc(n),  i  G  {1  ,...,L(n)},  and 
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i  e  {1, . . . ,  4 ej,  we  have 


E  E  A“»  ^  Kn)(4-'n)2-"h,{s-"l/2. 

ueVe^n)  v£Veti(n) 


Proof.  As  before,  denote  by  C(Si,S2 )  the  MIMO  capacity  between  nodes  in  S\  and 
nodes  in  S 2 ,  for  S'i,^  C  V.  By  the  cut-set  bound  [9,  Theorem  14.10.1], 


E  E  A“c„<c(U,„r,y.  (4.8) 

uev(ti 


Let 


H 


Si,S2 


=  [K,v. 


U,V}UE:Si,VGS2 


be  the  matrix  of  channel  gains  between  the  nodes  in  Si  and  S'2.  Under  fast  fading 


C(SUS2)=  max  E 

Q(H)>  0: 
E(i?u,u)<f>  v«eSi 


logdet  (l  +  HluS2Q(H)HSl,s2)\ 


and  under  slow  fading 


C(S,  Sc )  =  max  logdet  (J  +  iT/ iS2QiTSliS2). 

qu,u<P  V«e5i 

Denote  by  diVff)  the  nodes  in  that  are  within  distance  8 (a/2  log(n)  +  1)  of  the 
boundary  between  A\i  and  A^.  Using  the  generalized  Hadamard’s  inequality,  yields 
then  that  under  either  fast  or  slow  fading 


C(Vt 


»  V£i)  <  C(Ve,i,  d{V&)  +  C(Vt>i,  \  d{V&). 


(4.9) 


We  start  by  analyzing  the  first  term  in  the  sum  in  (4.9).  Applying  Hadamard’s 
inequality  again  yields 


C(Vw,3(V£))<  E 

ved(V£i) 
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Since  V  6  V,  we  have 


|<9(V^)|  <  40 log(n) ( y/\ 2 log(n)  +  1)(4  £n)1//2  <  1201og2(n)(4  ln)1^2 

and  by  the  same  analysis  as  in  Lemma  4.6,  we  have 

C(Ve,i,  M)  <  C({v}c,  M)  <  log(n) 

for  some  constant  K  (independent  of  v).  Therefore 

C{Veti,i diV^))  <  1201og(n)2(4“£n)1/2^log(n) 

<  K  log3(n)(4”£n)1//2.  (4.10) 


We  now  analyze  the  second  term  in  the  sum  in  (4.9).  Applying  Lemma  3.12, 


C{V^V^\d{V^)) 


<  4  max  <  1 


max  > 


Ec 


veV£i\d(vii) '  u,v 


«6V?i\s(yy1 


(4.11) 


We  now  show  that  the  term  in  parentheses  in  (4.11)  is  0(log3(n)).  Fix  u  G  V/  and 
denote  by  du  the  distance  of  u  to  the  closest  point  in  V/  \  d (V)CJ.  Note  that  be 
definition  of  <9(V)CJ,  we  have 


du  >  8 (a/2 log(n)  +  1). 

Since,  V  G  V,  there  are  at  least 

(%-4du(y/2  log(n)  +  1)  >  dl 

21og(n)  —  41og(n) 
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nodes  that  are  at  distance  at  most  3 du.  Hence 


E 


*evfti\d(Vfti) 


d2~a 

-a  >  - u - 

u’v  -  41+Q  log(n)  ’ 


and  therefore  for  any  v  G  \  d (V^), 


E 


^veAd(vli)  u’ 


r-<41+Qlog(n)^ 


<  41+"  log(n)r~ 


_2  /  1  U,V 


d.ti 


2— a 


<  41+Q  log(n)r, 


-2 
u,v ' 


(4.12) 


Using  he  V,  yields  that  for  any  v  G  V[i  \  d{V[i), 


uev*,,  ^®ey&\a(v&)  '  U,: 


—  <  4  +a  log  (to)  >  rUi„  <  4' +Q  ln(2)  log  (to). 


U€-V(>  i 


Combined  with  (4.11),  this  shows  that 


V  y  r- 


«evz  veV£( 


for  some  constant  K'  (independent  of  £,  and  i) . 


(4.13) 


Combining  (4.10)  and  (4.13)  with  (4.9),  we  obtain 


C(V,:i,Vey  <  K' log3(n)  y  y  r-;  +  A'log3(n)(4-'n)1/2.  (4.14) 

u£Vtti  veVfti\d(Vfti) 

Moreover,  using  again  the  same  arguments  as  in  [38,  Theorem  5.2]  shows  that  there 
exists  a  constant  K  such  that  for  adjacent  squares  (i.e.,  sharing  a  side)  Ag^Agj, 

<  A'log3(n)(4^n)2-min{3’Q}/2.  (4.15) 

«£h,i  vevtij\ d(VecA) 

Consider  now  two  diagonal  squares  (i.e.,  sharing  a  corner  point)  Agj,  Agj,  and  choose 


132 


i,  j  such  that  Afj  U  and  A^j  U  A^j  are  adjacent  rectangles.  Using  the  same 
arguments  to  these  rectangles  and  suitably  redefining  K  shows  that  (4.15)  holds  for 
diagonal  squares  as  well. 


Using  this,  we  now  compute  the  summation  in  (4.14).  Consider  “rings”  of  squares 
around  A^.  The  first  such  “ring”  contains  the  (at  most)  8  squares  neighboring  A^. 
The  next  “ring”  contains  at  most  16  squares.  In  general,  “ring”  k  contains  at  most 
8k  squares.  Let 

{Ae,j}jeik 

be  the  squares  in  “ring”  k.  Then 


I]  I]  ru*  =  ruav  (4-16) 

u£Vt,i  veVfti\d(vecti)  k> i  jeh  «eu,i  vevtJ\d (vee) 

By  (4.15), 

YJ2  r~av  <  8A'log3(n)(4-^n)2-mill{3’a}/2.  (4.17) 

jeh  u£Veti  veVt'jXdiyfj) 

Now  note  that  for  k  >  1  and  j  6  4,  nodes  u  G  by*  and  v  G  are  at  least  at  distance 
ru,v  >  (k  —  l)(2~e^/n).  Moreover,  since  V  G  V,  each  {I ^e,j}e,j  has  cardinality  at  most 
4 -£+1n.  Thus 

EEE  E 

k>i  jeik  ueVe^  veVjijXdiv^)  k> l 

=  128(4 -en)2~a/2  Y  k(k  ~  1)'" 

k>  1 

=  K"  (A~en)2~a/2 ,  (4.18) 

for  some  K"  >  0,  and  where  we  have  used  that  a  >  2.  Substituting  (4.17)  and  (4.18) 
into  (4.16)  yields 


Y  r~^  <8Klog%n)(A-en)2~min{3’a}/2 +  K,,(A~en)2  a/2.  (4.19) 

uev^  veVfti\d(Vfti) 
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Combining  (4.19)  with  (4.14)  and  (4.8)  shows  that 


V  v  A“c„<6(n)(4-'n)2-m“'|3-",/2. 

u&V^i  vgV(ti 

for  every  i  G  {1 , ,L(n)},i  G  {1, . . . , 4£},  and  under  either  fast  or  slow  fading.  □ 


Lemma  4.8.  Under  either  fast  or  slow  fading,  for  any  a  >  5,  there  exists  b(;n)  = 
0(log3(n))  such  that  for  all  V{n)  G  V{n),  Auc  G  Auc(n),  i  G  {1  ,...,L(n)},  and 
we  have 


V  5]  <  6(n)( 4-'n)‘/2. 

u^Ve:i{n)  veViti(n) 


Proof.  By  the  cut-set  bound  [9,  Theorem  14.10.1], 


E  (4-20) 

veVe'i 

Denote  by  dVf^  the  nodes  in  VE  that  are  within  distance  one  of  the  boundary  between 
Ar(  t  and  A^.  Applying  the  generalized  Hadamard  inequality  as  in  Lemma  4.7,  we 
have  under  either  fast  or  slow  fading 

C(V,%  vy  <  C(V,%  dVlti)  +  C(V,%  Vt,i  \  9V,ti) 

(4.21) 

<  K  log3(n)(4-,n)1''2  +  C(V,%  Vu  \  3U,<). 

for  some  constant  K . 

For  the  second  term  in  (4.21),  we  have  the  following  upper  bound  from  Theorem 
2.1  in  [21]: 

C(vt%  VU  \  evtt)  <  E  ( E  C/2)2- 

v€Viti\dViti  ueVfj 

Now,  consider  v  G  \  and  let  dv  be  the  distance  of  v  to  the  closest  node  in 
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VR.  Then,  using  he  V, 


E  rzT  <  K\og{n)dl~‘l\ 

U^ve,i 

for  some  constant  K,  and  hence  for  a  >  5, 

C(V,%  Vi,,  \  dvu )  <  Y  k 2  iog2(n)<-“ 

veVe,i\ 8Ve:i 

<  K'  log3(n)(4_^n)1//2 . 

Combined  with  (4.21),  this  proves  Lemma  4.8  for  traffic  from  V[i  to  VR.  □ 

4.4.3  Achievability  Lemmas 

In  this  section,  we  prove  auxiliary  achievability  results.  Recall  that  a  permutation 
traffic  is  a  traffic  pattern  in  which  each  node  is  source  and  destination  exactly  once. 
Call  the  corresponding  source-destination  pairing  II  C  V(n)  x  V(n)  a  permutation 
pairing.  The  lemma  below  analyzes  the  performance  achievable  with  either  hier¬ 
archical  relaying  (for  a  G  (2,3])  or  multi- hop  communication  (for  a  >  3)  applied 
simultaneously  to  transmit  permutation  traffic  in  several  disjoint  regions  in  the  net¬ 
work. 

Lemma  4.9.  Under  fast  fading,  for  any  a  >  2,  there  exists  b(n)  >  n~°^  such  that 
for  all  V(n)  €  V(n),  f  6  (0, ...  ,L(n)},  i  G  {1, . . .  ,  4£},  and  all  permutation  source- 
destination  pairings  IR  on  VR(n),  there  exists  Auc  G  Auc(n)  such  that 

min  min  >  6(n)(4^n)1-min{3’“}/2. 

>\  <i . i'  i  (u,ti)enj 

The  same  statement  holds  with  probability  1  —  o(l)  as  n  — >  oo  in  the  slow  fading  case. 

Consider  the  source-destination  pairing  II  =  UjlR  with  IR  as  in  Lemma  4.9.  This 
is  a  permutation  pairing,  since  each  IR  is  a  permutation  pairing  on  VR(n)  and  since 
the  {VR(n)}j  are  disjoint.  Lemma  4.9  states  that  all  source-destination  pairs  in  II 
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can  communicate  at  a  per-node  rate  of  at  least  Note  that, 

due  to  the  locality  of  the  traffic  pattern,  this  can  be  considerably  better  than  the 
ni-mm{3,a}/2-o(i)  per_nof]e  rate  achieved  by  standard  hierarchical  relaying  or  multi¬ 
hop  communication. 


Proof.  We  shall  use  either  hierarchical  relaying  (for  a  G  (2,3])  or  multi-hop  (for 
a  >  3)  to  communicate  within  each  square  V^.  We  operate  every  fourth  of  the  Vy* 
simultaneously,  and  show  that  the  added  interference  due  to  this  spatial  re-use  results 
only  in  a  constant  factor  loss  in  rate. 

Consider  first  a  G  (2,3]  and  fast  fading.  The  squares  at  level  t  have  an  area 


We  will  use  hierarchical  relaying  within  each  of  the  Applying  Theorem  3.1) 

with  <5  =  1/4,  it  is  sufficient  to  show  that  we  can  partition  each  Agj  into 


l+log  1/4(a^) 
Up 


subsquares,  each  of  which  contains  a  number  of  nodes  proportional  to  the  area  (see 
Section  3.10.4).  In  other  words,  we  partition  A  into  subsquares  of  size 


_ 1 _  log  1^4(rt) 

l  +  log-1/4^)  ^  l  +  log_1/4(n) 

“  HA 

>  log-1/3(n) 

-  aL(n) 

=  nlog-5/6(n) 

>  n4_log^n^1_log_5/6^"^. 


Since  V  G  V,  all  these  subsquares  contain  a  number  of  nodes  proportional  to  their 
area,  and  hence  this  shows  that  all 


{A/} 


fe{0,...,L(n)},ie{l,...,4«} 
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are  simultaneously  regular  enough  for  hierarchical  relaying  to  be  successful  under  fast 
fading.  By  Theorem  3.1,  this  achieves  a  per-node  rate  of 


\"cv  >n"o(1)(4-‘n) 1-0/2 


(4.22) 


for  any  (u,v)  G  Ilj. 

We  now  show  that  (4.22)  holds  with  high  probability  also  under  slow  fading.  By 
Theorem  3.1,  for  V  G  V  hierarchical  relaying  is  successful  under  slow  fading  for  all 
permutation  traffic  on  V  with  probability  at  least 

1  -  exp  (  -  2Klog3/») 

for  some  constant  K.  Hence,  hierarchical  relaying  is  successful  for  all  permutation 
traffic  on  with  probability  at  least 

1  -  exp  (  -  2A'log3/4(4'^))  >  1  -  exp  (  -  2Klo^4-ZMn^ 

=  1  -  exp  (^_2xlog3/8(n)). 

And  hence  hierarchical  relaying  is  successful  under  slow  fading  for  all  £  G  {1, . . . ,  L(n)} 
and  all  permutation  traffic  on  every  {W,i}f=1  with  probability  at  least 

1  -  L(n)4Z(">exp  (  -  2JrlosS,»)  >  1  -  n2exp  (  -  2,f '«’'») 

>l-o(l) 


as  n  — »  oo. 

We  now  argue  that  the  additional  interference  from  spatial  re-use  results  only  in  a 
constant  loss  in  rate.  This  follows  from  the  same  arguments  as  in  the  proof  of  Theorem 
3.1  (with  the  appropriate  modifications  for  slow  fading  as  described  there).  Intuitively, 
this  is  the  case  since  the  interference  from  a  square  at  distance  r  is  attenuated  by  a 
factor  r~a,  which,  since  a  >  2,  is  summable.  Hence  the  combined  interference  has 
power  on  the  order  of  the  receiver  noise,  resulting  in  only  a  constant  factor  loss  in 
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rate. 


For  a  >  3,  the  argument  is  similar  —  instead  of  hierarchical  relaying  we  now 
use  multi-hop  communication.  For  V  G  V  and  under  either  fast  or  slow  fading,  this 
achieves  a  per-node  rate  of 


>  n-°m(4,-‘n Q1/2 


(4.23) 


for  any  (u,  v)  G  IF,;.  Combining  (4.22)  and  (4.23)  yields  the  desired  result. 


□ 


4.5  Proof  of  Theorem  4.1 

The  proof  of  Theorem  4.1  relies  on  the  construction  of  a  capacitated  (noiseless,  wire- 
line)  graph  G  and  linking  its  performance  under  routing  to  the  performance  of  the 
wireless  network.  This  graph  G  =  (Vq,  Eq)  is  constructed  as  follows.  G  is  a  full  tree 
(i.e.,  all  its  leaf  nodes  are  on  the  same  level).  G  has  n  leaves,  each  of  them  represent¬ 
ing  an  element  of  V(n).  To  simplify  notation,  we  assume  that  V(n)  C  Vq,  so  that 
the  leaves  of  G  are  exactly  the  elements  of  V ( n )  C  Vq.  Whenever  the  distinction  is 
relevant,  we  use  u,v  for  nodes  in  V(n)  C  Vq  and  //,  v  for  nodes  in  Vq  \  V(n)  in  the 
following.  The  internal  nodes  of  G  correspond  to  V^i(n)  for  all  £  G  {0, . . . ,  L(n)}, 
i  G  {1, . . .  ,4r},  with  hierarchy  induced  by  the  one  on  A{n).  In  particular,  let  /i  and 
v  be  internal  nodes  in  Vq  and  let  V^(n)  and  Vg+i j(n)  be  the  corresponding  subsets 
of  V(n).  Then  v  is  a  child  node  of  //  if  W+i j(n)  C  Ve,i(n). 

In  the  following,  we  will  assume  V (■ n )  G  V(n),  which  holds  with  probability  1— o(l) 
as  n  OG  by  Lemma  4.5.  With  this  assumption,  nodes  in  Vq  at  level  £  <  L(n )  have 
4  children  each,  nodes  in  Vq  at  level  £  =  L(n )  have  between  4 ~LG)~1n  and  4~L(jl)+1n 
children,  and  nodes  in  Vq  at  level  £  =  L(n)  +  1  are  the  leaves  of  the  tree  (see  Figure 
4-5  below  and  Figure  4-3  in  Section  4.3). 

For  n  G  Vq,  denote  by  £(/i)  the  leaf  nodes  of  the  subtree  of  G  rooted  at  /i.  Note 
that,  by  construction  of  the  graph  G ,  £(/i)  =  Vt^n)  for  some  £  and  i.  To  understand 
the  relation  between  Vq  and  V(n),  we  define  the  representative  7Z  :  Vq  2v(~n)  Qf  ^ 
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1  =  0 

1  =  1 

l  =  L(n) 

l  =  L(n) 

Figure  4-5:  Communication  graph  G  constructed  in  the  proof  of  Theorem  4.1.  Nodes 
on  levels  £  G  {0, . . . ,  L(n)  —  1}  have  four  children  each,  nodes  on  level  £  =  L(n)  have 
0  (nlog  1  2G)'j  children  each.  The  total  number  of  leaf  nodes  is  n,  one  representing  each 
node  in  the  wireless  network  V(n).  An  internal  node  in  G  at  level  £  G  {0, . . . ,  L(n)} 
represents  the  collection  of  nodes  in  V^*(n)  for  some  i. 

as  follows.  For  a  leaf  node  u  E  V (n)  C  VG  of  G,  let 

K(u)  =  {u}. 

For  n  E  VG  at  level  L(n),  choose  7 Z(fj,)  C  £(/x)  C  V(n)  such  that 

\K{n)\  =4-£(n)“1n. 

This  is  possible  since  V(n)  G  V(n)  by  assumption.  Finally,  for  /x  G  VG  at  level 
£  <  L(n),  and  with  children  {;y  }j=1,  let 

4 

n»)  =  IJ^W- 

i= i 

We  now  define  an  edge  capacity  for  each  edge  (/x,  i/)  G  EG.  If  /z  is  a  leaf  of  G 
and  i/  its  parent,  set 

=  c„iM  =  1.  (4.24) 

If  /x  is  an  internal  node  at  level  £  in  G  and  zx  its  parent,  then  set 

C*„  =  4  (4-^)2-min{3,a}/2_  (4.25) 
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Having  chosen  edge  capacities  on  G,  we  can  now  define  the  set  A^.c(n)  C  M”xn  of 
feasible  unicast  traffic  matrices  between  leaf  nodes  of  G.  In  other  words,  Auc  G  A^.c(n) 
if  messages  at  the  leaf  nodes  of  G  can  be  routed  to  their  destinations  (which  are  also 
leaf  nodes)  over  G  at  rates  Auc  while  respecting  the  capacity  constraints  on  the  edges 
of  G. 

We  first  prove  the  achievability  part  of  Theorem  4.1.  The  next  lemma  shows  that 
if  traffic  can  be  routed  over  the  tree  G,  then  approximately  the  same  traffic  can  be 
transmitted  reliably  over  the  wireless  network. 

Lemma  4.10.  Under  fast  fading,  for  any  a  >  2  ,  there  exists  b(n )  >  n~°^  such  that 
for  any  V(n)  G  V(n), 


b(n)AQC(n)  C  Auc(n). 

The  same  statement  holds  for  slow  fading  with  probability  1  —  o(l)  as  n  — >  oo. 

Proof.  Assume  Auc  G  AgC,  i.e.,  traffic  can  be  routed  between  the  leaf  nodes  of  G  at 
a  rate  Auc,  we  need  to  show  that  n_°^1^Auc  G  Auc  (i.e.,  almost  the  same  flow  can  be 
reliably  transmitted  over  the  wireless  network) .  We  use  the  three- layer  communication 
architecture  introduced  in  Section  4.3  to  establish  this  result. 

Recall  the  three  layers  of  this  architecture:  the  routing,  cooperation,  and  physical 
layers.  The  layers  of  this  communication  scheme  operate  as  follows.  In  the  routing 
layer,  we  treat  the  wireless  network  as  the  graph  G  and  route  the  messages  over  the 
edges  of  G.  The  cooperation  layer  provides  this  tree  abstraction  to  the  routing  layer 
by  distributing  and  concentrating  messages  over  subsets  of  the  wireless  network.  The 
physical  layer  implements  this  distribution  and  concentration  of  messages  by  dealing 
with  interference  and  noise. 

Consider  first  the  routing  layer,  and  assume  that  the  tree  abstraction  G  can  be 
implemented  in  the  wireless  network  with  only  a  n~° ^  factor  loss.  Since  Auc  G  AgC 
by  assumption,  we  then  know  that  the  routing  layer  will  be  able  to  reliably  transmit 
messages  at  rates  n_°^1^Auc.  We  now  show  that  the  tree  abstraction  can  indeed  be 
implemented  in  the  wireless  network. 
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This  tree  abstraction  is  provided  to  the  routing  layer  by  the  cooperation  layer.  We 
will  show  that  the  operation  of  the  cooperation  layer  satisfies  the  following  invariance 
property:  If  a  message  is  located  at  a  node  /x  G  G  in  the  routing  layer,  then  the  same 
message  is  evenly  distributed  over  all  nodes  in  7 Z(n)  in  the  wireless  network.  In  other 
words,  all  nodes  u  G  7Z(n)  C  V  have  access  to  a  distinct  part  of  length  of 

the  message. 

Consider  first  a  leaf  node  u  G  V  C  Vq  in  G,  and  assume  the  routing  layer  calls 
upon  the  cooperation  layer  to  send  a  message  to  its  parent  v  G  Vq  in  G.  Note  first 
that  u  is  also  an  element  of  V,  and  it  has  access  to  the  entire  message  to  be  sent 
over  G.  Since  for  leaf  nodes  7 Z(u)  =  {«},  this  shows  that  the  invariance  property  is 
satisfied  at  u.  The  message  is  split  at  u  into  \TZ(u)  \  parts  of  equal  length,  and  one  part 
is  sent  to  each  node  in  7 Z(u)  over  the  wireless  network.  In  other  words,  we  distribute 
the  message  over  the  wireless  network  by  a  factor  of  \H(y)\.  Hence  the  invariance 
property  is  also  satisfied  at  /i. 

Consider  now  an  internal  node  fi  G  Vq,  and  assume  the  routing  layer  calls  upon 
the  cooperation  layer  to  send  a  message  to  its  parent  node  v  G  Vq.  Note  that  since 
all  traffic  in  G  originates  at  the  leaf  nodes  of  G  (which  are  the  actual  nodes  in  the 
wireless  network),  a  message  at  //  had  to  traverse  all  levels  below  /j  in  the  tree  G.  We 
assume  that  the  invariance  property  holds  up  to  level  /i  in  the  tree,  and  show  that 
it  is  then  also  satisfied  at  level  v.  By  the  induction  hypothesis,  each  node  u  G  7 Z(fi) 
has  access  to  a  distinct  part  of  length  l/\TZ(fj,)\  of  the  message.  Each  such  node  u 
splits  its  message  part  into  four  distinct  parts  of  equal  length.  Node  u  keeps  one  part 
for  itself,  and  sends  the  other  three  parts  to  nodes  in  7 Z(u).  Since  \1Z{v)  \  =  4|7£(/i)|, 
this  can  be  performed  such  that  each  node  in  7 Z(u)  obtains  exactly  one  message  part. 
In  other  words,  we  distribute  the  message  by  a  factor  four  over  the  wireless  network, 
and  the  invariance  property  is  satisfied  at  v  G  Vq- 

Operation  along  edges  down  the  tree  (i.e.,  towards  the  leaf  nodes)  is  similar,  but 
instead  of  distributing  messages,  we  now  concentrate  them  over  the  wireless  network. 
To  route  a  message  from  a  node  /i  G  Vq  with  internal  children  { }j=  x  to  one  of  them 
(say  U\ )  in  the  routing  layer,  the  cooperation  layer  sends  the  message  parts  from  each 
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{'R'{vj)}j=2  to  a  corresponding  node  in  1Z{u\ )  and  combines  them  there.  In  other 
words,  we  concentrate  the  message  by  a  factor  four  over  the  wireless  network. 

To  route  a  message  to  a  leaf  node  u  G  V  C  Vq  from  its  parent  v  in  G  in  the 
routing  layer,  the  cooperation  layer  sends  the  corresponding  message  parts  at  each 
node  7 Z(u)  to  u  over  the  wireless  network.  Thus,  again  we  concentrate  the  message 
over  the  network,  but  this  time  by  a  factor  of  \lZ(v)\.  Both  these  operations  along 
edges  down  the  tree  preserve  the  invariance  property.  This  shows  that  the  invariance 
property  is  preserved  by  all  operations  induced  by  the  routing  layer  in  the  cooperation 
layer. 

Finally,  to  actually  implement  this  distribution  and  concentration  of  messages, 
the  cooperation  layer  calls  upon  the  physical  layer.  Note  that  at  the  routing  layer,  all 
edges  of  the  tree  can  be  routed  over  simultaneously.  Therefore,  the  cooperation  layer 
can  potentially  call  the  physical  layer  to  perform  distribution  and  concentration  of 
messages  over  all  sets  simultaneously.  The  function  of  the  physical  layer 

is  to  schedule  all  these  operations  and  to  deal  with  the  resulting  interference  as  well 
as  with  channel  noise. 


This  scheduling  is  done  as  follows.  First,  the  physical  layer  time  shares  between 
communication  up  the  tree  and  communication  down  the  tree  (i.e.,  between  distri¬ 
bution  and  concentration  of  messages).  This  results  in  a  loss  of  a  factor  1/2  in  rate. 
The  physical  layer  further  time  shares  between  all  the  L(n)  +  1  internal  levels  of  the 
tree,  resulting  in  a  further  1)+1  factor  loss  in  rate.  Hence,  the  total  rate  loss  by  this 
time  sharing  is 


1 

2(Z(n)  +  1) ' 


(4.26) 


Consider  now  the  operations  within  some  level  t  G  1, , . . ,  L{ri)  in  the  tree  (i.e.,  for 
edge  (p,  z/)  on  this  level,  neither  p  nor  v  is  a  leaf  node).  We  show  that  the  rate  at 
which  the  physical  layer  implements  the  edge  (p,  u)  is  equal  to  n~o(^c^v^  i.e.,  only  a 
n°^ 1 !  factor  less  than  the  capacity  of  the  edge  (p,  u)  in  the  tree  G.  Note  first  that  the 
distribution  or  concentration  of  traffic  induced  by  the  cooperation  layer  to  implement 
one  edge  e  at  level  l  is  restricted  to  W-i,i  for  some  %  =  %{e).  We  can  thus  partition 
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the  edges  at  level  l  into  {Eg}‘j=1  such  that  the  four  sets 

U 

e£-EG 

of  nodes  are  disjoint.  Time  sharing  between  these  four  sets  yields  an  additional  loss 
of  a  factor  1/4  in  rate.  Fix  one  such  value  of  j,  and  consider  the  operations  induced 
by  the  cooperation  layer  in  the  set  corresponding  to  j.  We  consider  communication 
up  the  tree  (i.e.,  distribution  of  messages),  the  analysis  for  communication  down  the 
tree  is  similar.  For  a  particular  edge  (//,  u)  G  EJG  with  v  the  parent  of  //,  each  node 
u  G  7 Z(n)  has  split  its  message  part  into  four  parts,  three  of  which  need  to  be  sent  to 
other  nodes  in  7 Z(v).  Moreover,  this  assignment  of  destination  nodes  in  7 Z(u)  to  u  is 
performed  such  that  no  node  in  7 Z(v)  is  destination  more  than  once.  In  other  words, 
each  node  in  7 Z(n)  is  source  exactly  three  times  and  each  node  in  7 Z(u)  is  destination 
at  most  once.  This  can  be  written  as  three  source-destination  pairings  {fl/^  ^}|=1,  on 
Ve-i,i(n,v)-  Moreover,  each  such  II v ^  can  be  understood  as  a  subset  of  a  permutation 
source-destination  pairing.  We  time  share  between  the  three  values  of  k  (yielding  an 
additional  loss  of  a  factor  1/3  in  rate).  Now,  for  each  value  of  k,  Lemma  4.9  shows 
that  by  using  either  hierarchical  relaying  (for  a  G  (2,  3])  or  multi-hop  communication 
for  (a  >  3),  we  can  communicate  according  to  {II^}eeE^  at  a  per- node  rate  of 

n-°(1)  (4— 17j,)1— min{3>a}/2 

under  fast  fading,  and  with  probability  1  —  o(l)  also  under  slow  fading.3  Since  7 Z(fi) 
contains  4~l~ln  nodes,  and  accounting  for  the  loss  (4.26)  for  time  sharing  between 
the  levels  in  G  and  the  additional  loss  of  factors  1/4  and  1/3  for  time  sharing  between 
j  and  k ,  the  physical  layer  implements  an  edge  capacity  for  e  at  level  £  of 

~  1 _  .1.1. 4 -^“1n  .  n-°(1)('4-t-ln')l-min{3,«}/2 

2(L(n)  +  l)  4  3 

3Note  that  Lemma  4.9  actually  shows  that  all  permutation  traffic  for  every  value  of  1  can  be 
transmitted  with  high  probability  under  slow  fading.  In  other  words,  with  high  probability  all  levels 
of  G  can  be  implemented  successfully  under  slow  fading. 
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=  n-«(i)(4-^n)2-mhl{3’"}/2  =  n-o(1)c, 


Consider  now  the  operations  within  level  i  =  L{ri)  +  1  in  the  tree  (i.e. ,  for  edge 
(u,  v)  on  this  level,  u  is  a  leaf  node).  We  show  that  the  rate  at  which  the  physi¬ 
cal  layer  implements  the  edge  (w,  u)  is  equal  to  n~°^cUiU.  We  again  consider  only 
communication  up  the  tree  (i.e.,  distribution  of  messages  in  the  cooperation  layer), 
communication  down  the  tree  is  performed  in  a  similar  manner.  The  traffic  induced 
by  the  cooperation  layer  at  level  L(n)  + 1  is  within  the  sets  Vi(n)  i  f°r  *  =  {1,  -  -  - ,  4L<n^}. 
Consider  now  communication  within  one  Vz(n)  y  and  assume  without  loss  of  generality 
that  in  the  routing  layer  every  node  u  G  i  needs  to  send  traffic  along  the  edge 
(u,u).  In  the  physical  layer,  we  need  to  distribute  a  l/\lZ{y)\  fraction  of  this  traffic 
from  each  node  u  G  to  every  node  in  7 Z(u)  C  t-  This  can  be  expressed 

as  source-destination  pairings,  and  we  time  share  between  them.  Accounting 

for  the  fact  that  only  l/\R,(y)\  of  traffic  needs  to  be  sent  according  to  each  pairing 
and  since  V  G  V,  this  results  in  a  time  sharing  loss  of  at  most  a  factor 

gMj  ,  i 

Now,  using  Lemma  4.9,  all  these  source-destination  pairings  in  all  subsquares  j} 

can  be  implemented  simultaneously  at  a  per  node  rate  of 

n-o(l)(4-Zn)l— min{3, a}/2  >  n~o(l)  ^log-1'2 (n)yl/2  >  „-»(!)_ 


Accounting  for  the  loss  (4.26)  for  time  sharing  between  the  levels  in  G,  the  additional 
factor  1/16  loss  for  time  sharing  within  each  Vjjn)  the  physical  layer  implements  an 
edge  capacity  for  e  at  level  i  =  L{n)  +  1  of 


1 

2  (L(n)  +  1) 


—  •  n“o(1)  =  n“o(1)  =  n-°(1)c, 
16 


under  either  fast  or  slow  fading. 

Together,  this  shows  that  the  physical  and  cooperation  layers  provide  the  tree 
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abstraction  G  to  the  routing  layer  with  edge  capacities  of  only  a  factor  n~°^  loss. 
Hence,  if  messages  can  be  routed  at  rates  Auc  between  the  leaf  nodes  of  G ,  then 
messages  can  be  reliably  transmitted  over  the  wireless  network  at  rates  n-0WAuc. 
Hence 

Auc  G  Augc  =»  Auc  e  Auc, 

and  noting  that  the  n~°^  factor  is  uniform  in  Auc,  this  shows  that 

n-°^AuGc  C  Auc. 


□ 

We  have  seen  that  the  unicast  capacity  region  A^c(n)  of  the  graph  G  under  routing 
is  (appropriately  scaled)  an  inner  bound  to  the  unicast  capacity  region  Auc(n)  of  the 
wireless  network.  The  next  lemma  shows  that  A^c(n)  is  equal  to  the  approximate 
unicast  capacity  region  A^c(n)  of  the  wireless  network  as  defined  in  (4.2).  Combining 
Lemmas  4.5,  4.10,  and  4.11  below,  yields  that  with  high  probability 

n-o(1)Auc(n)  n-o(1)AUC(n)  (-  Auc(n), 
proving  the  achievability  part  of  Theorem  4.1. 

Lemma  4.11.  For  any  a  >  2  and  any  V(n)  G  V(n), 

AiC(n)  =  A^c(n). 

Proof.  We  first  relate  the  total  traffic  across  an  edge  e  in  the  graph  G  to  the  total 
traffic  across  a  cut  V()%  for  some  £  and  i. 

Consider  an  edge  e  =  (/x,  u)  G  EG:  and  assume  first  that  e  connects  a  node  at 
level  l  +  1  and  l  in  the  tree  with  £  <  L(n).  We  slight  abuse  of  notation,  set 

A 
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Note  first  that  by  (4.25)  we  have 


■e  =  (4— «n)2-mm{3,a}/2_ 


(4.27) 


Moreover,  since  G  is  a  tree,  removing  the  edge  e  from  Eq  separates  the  tree  into  two 
connected  components,  say  Si,S2  C  Vq.  Consider  now  the  leaf  nodes  in  S\.  By  the 
construction  of  the  tree  structure  of  G,  these  leaf  nodes  are  either  equal  to  or  V[{ 
for  some  i  G  {1, . . . ,  4£}.  Assume  without  loss  of  generality  that  they  are  equal  to  V^. 
Then  are  the  leaf  nodes  in  .5V  Now  since  traffic  is  only  assumed  to  be  between 
leaf  nodes  of  G,  the  total  traffic  demand  between  Si  and  S2  is  equal  to 


y:  (4.28) 

By  the  tree  structure  of  G,  all  this  traffic  has  to  be  routed  over  edge  e. 

Consider  now  an  edge  e  connecting  a  node  at  level  L(n )  +  1  and  L(n),  i.e.,  a  leaf 
node  u  to  its  parent  v.  Then,  by  (4.24), 

ce  =  1,  (4.29) 

and  the  total  traffic  crossing  the  edge  e  is  equal  to 

y~l(-^,w  +  -^w)-  (4.30) 

W^U 


We  now  show  that 


Assume  Auc  G  A^c,  then 


A? 


uc  c  A"c. 


(4.31) 


V  5]  (Aj£  +  A“CJ  < 

u€Veti  W^VE 
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for  all  i  G  {1, . . . ,  L(n)},  i  G  {1, . . . ,  4£},  and 

+  A^CJ  <  1 

W^U 

for  all  u  G  V.  By  (4.27),  (4.28),  (4.29),  (4.30),  this  implies  that  the  traffic  demand 
across  each  edge  e  of  the  graph  G  is  less  than  its  capacity  ce.  Since  G  is  a  tree,  this 
implies  that  Auc  can  be  routed  over  G,  i.e.,  Auc  G  AqC.  This  proves  (4.31). 

We  now  show  that 

A^.c  C  A^c.  (4.32) 

Assume  Auc  G  A^b  This  implies  Auc  can  be  routed  over  G ,  and  hence  the  total 
flow  across  each  edge  is  less  than  the  capacity  of  that  edge.  By  (4.27),  (4.28),  (4.29), 
(4.30),  this  implies  Auc  G  A^0,  from  which  (4.32)  follows.  □ 

We  now  turn  to  the  converse  part  of  Theorem  4.1.  The  next  lemma  shows  that 
A2  c (n)  (appropriately  scaled)  is  an  outer  bound  to  the  unicast  capacity  region  Auc(n) 
of  the  wireless  network.  Combined  with  Lemma  4.5,  this  shows  that  with  high  prob¬ 
ability 

Auc(n)  C  0(log6(n))A^c(n), 
proving  the  converse  part  of  Theorem  4.1. 

Lemma  4.12.  Urider  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  b{n )  = 
0(log6(n))  such  that  for  any  V(n)  G  V(n), 

Auc(n)  C  5(n)Auc(n). 

Proof.  Assume  Auc  G  Auc.  By  Lemma  4.7,  we  have  for  any  i  G  {1, . . .  ,L(n)}  and 
i  e 

T  £  <  inog'wm'n)2-™"13^72,  (4.33) 

ueVC'i  veVfa 

for  some  constant  K  not  depending  on  Auc.  Moreover,  Lemma  4.6  shows  that  for  all 
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£(A“C„  +  A„uy  <  2inog(Ii).  (4.34) 

V^U 

Combining  (4.33)  and  (4.34)  proves  that  there  exists  b(n)  =  0(log6(n))  such  that 
Auc  G  Auc  implies  Auc  G  b[n) A^0,  proving  the  lemma.  □ 


4.6  Proof  of  Theorem  4.2 

Assume  Auc  G  Auc.  By  Lemmas  4.7  and  4.8,  we  have  for  any  i  G  {1, . . . ,  L(n)}, 
i  G  {1, . . . ,  4£j,  and  a  >  5, 


max 


^u,v>  ^ju£Veci  Ve:i  A“c„  }  <  inog6(n)(4-'n) 


2— min{3,o;}/2 


for  some  constant  K  not  depending  on  Auc.  Hence 


£  £  (A«,+XS)  <  2A'log6(n)(4-'n)2-”"'l3'»l/2. 

ueviti  veV[ti 

Combined  with  the  outer  bound  in  Theorem  4.1,  this  shows  that  there  exists  b(n)  = 
0(log6(n))  such  that  for  a  >  5,  Auc  G  Auc  implies  Auc  G  b(n) A^0,  proving  Theorem 
4.2. 


4.7  Discussion 

We  discuss  several  aspects  and  extensions  of  the  three-layer  architecture  introduced 
in  Section  4.3  and  used  to  show  achievability  in  Theorem  4.1.  In  Section  4.7.1, 
we  comment  on  the  various  tree  structures  used  in  the  three-layer  architecture.  In 
Section  4.7.2,  we  show  that  for  certain  values  of  a  the  bounds  in  Theorem  4.1  can  be 
significantly  sharpened.  In  Section  4.7.3,  we  point  out  how  the  results  discussed  so 
far  can  be  used  to  obtain  the  scaling  of  the  unicast  capacity  region  of  dense  networks 
(where  n  nodes  are  randomly  placed  on  a  square  of  unit  area).  Section  4.7.4  contains 
design  guidelines  that  can  be  obtained  from  the  scaling  results  for  the  unicast  capacity 
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region  presented  in  this  chapter. 


4.7.1  Tree  Structures 

There  are  two  distinct  tree  structures  that  are  used  in  the  construction  of  the  three- 
layer  communication  scheme  proposed  in  this  chapter  —  one  explicit  and  one  implicit. 
These  two  tree  structures  appear  in  different  layers  of  the  communication  scheme  and 
serve  different  purposes. 

The  first  (explicit)  tree  structure  is  given  by  the  tree  G  utilized  in  the  routing  layer 
and  implemented  in  the  cooperation  layer.  The  main  purpose  of  this  tree  structure 
is  to  perform  localized  load  balancing.  In  fact,  the  distribution  and  concentration  of 
traffic  is  used  to  avoid  unnecessary  bottlenecks.  Note  that  the  tree  G  is  used  by  the 
scheme  for  any  value  of  a. 

The  second  (implicit)  tree  structure  occurs  in  the  physical  layer.  This  tree  struc¬ 
ture  appears  only  for  values  of  a  G  (2,  3],  when  the  physical  layer  operates  using  the 
hierarchical  relaying  scheme  (see  Chapter  3).  It  is  this  hierarchical  structure  of  this 
scheme  that  can  equivalently  be  understood  as  a  tree.  The  purpose  of  this  second  tree 
structure  is  to  enable  multiple  antenna  communication,  i.e.,  to  perform  cooperative 
communication. 


4.7.2  Second-Order  Asymptotics 

The  scaling  result  in  Theorems  4.1  and  4.2  are  up  to  a  factor  n±0 ^  and  hence  preserve 
information  at  scale  rC  for  constant  (3  (see  also  the  discussion  in  Section  4.1.2).  Here 
we  take  a  closer  look  at  the  behavior  of  this  n±o(T  factor,  and  show  that  in  certain 
situations  it  can  be  significantly  sharpened. 

Note  first  that  the  outer  bound  in  Theorems  4.1  and  4.2  hold  up  to  a  factor 
log6(n),  i.e.,  poly- logarithmic  in  n.  However,  the  inner  bound  holds  only  up  to  the 
aforementioned  n~°^  factor.  A  closer  look  at  the  proof  of  the  theorems  reveals  that 
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the  precise  inner  bound  is  of  order 


n~°{  log  1/4(n)) 

and  with  a  more  careful  analysis  (see  Chapter  3  for  the  details),  this  can  be  sharpened 
to  essentially 

n~°{  iog-1/2bd) 


The  exponent  log_1^2(n)  in  the  inner  bound  has  two  causes.  The  first  is  the  use 
of  hierarchical  relaying  (for  a  G  (2,3]).  The  second  is  the  operation  of  the  physical 
layer  at  level  L(n)  +  1  of  the  tree  (i.e.,  to  implement  communication  between  the  leaf 
nodes  of  G  and  their  parents).  Indeed  at  that  level,  we  are  operating  on  a  square  of 
area 

4 -£(n)n  =  nIogV2(n), 

and  the  loss  is  essentially  inversely  proportional  to  that  area.  Now,  the  reason  why 
L(n)  cannot  be  chosen  to  be  larger  (to  make  this  loss  smaller),  is  because  hierarchical 
relaying  requires  a  certain  amount  of  regularity  in  the  node  placement,  which  can 
only  be  guaranteed  for  large  enough  areas. 


This  suggests  that  for  the  a  >  3  regime,  where  multi-hop  communication  is  used 
at  the  physical  layer  instead  of  hierarchical  relaying,  we  might  be  able  to  significantly 
improve  the  inner  bound.  To  this  end,  we  have  to  choose  more  levels  in  the  tree  G, 
such  that  at  the  last  level  before  the  tree  nodes,  we  are  operating  on  a  square  that 
has  an  area  of  order  log(n).  Changing  the  three- layer  architecture  in  this  manner,  for 
a  >  3  the  inner  bound  can  be  improved  to  be  poly-logarithmic  in  n  as  well.  Combined 
with  the  poly-logarithmic  outer  bound,  this  yields  a  poly-logarithmic  approximation 
for  n 2  —  n  out  of  n2  total  dimensions  of  the  unicast  capacity  region  A uc(n)  for  a  G 
(3,5],  and  a  poly- logarithmic  approximation  for  the  entire  unicast  capacity  region 
Auc(n)  for  a  >  5. 
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4.7.3  Dense  Networks 


So  far,  we  have  only  discussed  extended  networks,  i.e.,  n  nodes  are  located  on  a  square 
of  area  n.  We  now  briefly  sketch  how  these  results  can  be  recast  for  dense  networks, 
in  which  n  nodes  are  located  on  a  square  of  unit  area.  It  suffices  to  notice  that  by 
rescaling  power  by  a  factor  n~a /2,  a  dense  network  can  essentially  be  transformed 
into  an  equivalent  extended  network  with  path- loss  exponent  a  (see  also  [38]).  Hence 
the  scaling  of  the  unicast  capacity  region  for  dense  networks  can  be  obtained  from 
the  scaling  result  for  extended  networks  by  taking  a  limit  as  a  — >  2. 

The  resulting  approximate  unicast  capacity  region  A^c(n)  has  a  particularly  sim¬ 
ple  shape  in  this  limit.  In  fact,  the  only  constraints  in  (4.2)  that  can  be  tight  are  at 
level  i  =  log(n).  This  results  in  the  following  approximate  unicast  capacity  region  for 
dense  networks: 

A“c(n)  4  |auc  e  KJX"  :  £>«,$  +  Atuc„)  <  1,  V  u  e  V(n)}, 

and  we  obtain  that  for  dense  networks,  for  any  a  >  2, 

n_°(1)A)JC(n)  C  Auc(n)  C  0(log6(n))A)JC(n). 

Note  that,  in  contrast  to  the  extended  case,  this  results  in  an  approximate  character¬ 
ization  of  the  entire  unicast  region  for  all  a  >  2. 

4.7.4  Design  Guidelines 

The  results  presented  in  this  chapter  suggest  the  following  design  guidelines  for  the 
construction  and  operation  of  large  wireless  networks.  First,  it  shows  that  load  balanc¬ 
ing  is  crucial  for  optimal  operation.  This  is  implemented  in  the  proposed  three-layer 
architecture  by  spreading  traffic  over  clusters  of  neighbors,  but  could  be  achieved  in 
other  ways  as  well.  A  communication  protocol  not  using  such  load  balancing  could 
create  bottlenecks  that  prevent  it  from  operating  optimally.  It  is  worth  pointing  out 
that  in  the  case  of  random  source-destination  pairing  with  uniform  rate  no  such  load 
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balancing  is  necessary  since  the  traffic  demand  itself  is  already  balanced. 

Second,  the  load  balancing  has  to  be  done  in  a  localized  manner.  In  the  three-layer 
architecture,  this  is  achieved  by  choosing  the  cluster  over  which  the  load  balancing  is 
performed  to  be  those  nodes  that  are  in  the  same  smallest  square  V^j(n)  that  contains 
both  the  source  and  its  destination.  Again,  other  ways  to  perform  this  load  balancing 
are  possible,  however  for  optimal  operation  this  load  balancing  must  have  the  same 
localized  structure. 

Finally,  recall  that  for  random  node  placement  and  random  source-destination 
pairing  with  uniform  rate,  hierarchical  cooperation  is  order  optimal  for  a  G  (2,3], 
and  multi-hop  communication  is  order  optimal  for  a  >  3.  The  same  threshold  phe¬ 
nomenon  occurs  also  for  random  node  placement  with  heterogeneous  traffic.  In  the 
three-layer  architecture,  this  is  visible  in  the  physical  layer,  which  uses  hierarchical 
relaying  for  a  G  (2, 3]  and  multi-hop  communication  for  a  >  3.  In  other  words, 
for  optimal  transmission  of  heterogeneous  traffic  over  randomly  placed  nodes,  global 
cooperation  is  necessary  for  small  path-loss  exponents,  and  local  cooperation  is  suf¬ 
ficient  for  large  path-loss  exponents.  Note  that  the  assumption  of  randomly  placed 
nodes  is  crucial  for  this  conclusion  to  be  valid,  as  was  discussed  in  detail  in  Section  3. 


4.8  Chapter  Summary 

In  this  chapter,  we  have  obtained  information-theoretic  inner  and  outer  bounds  on 
the  n2-dimensional  unicast  capacity  region  of  a  wireless  network  with  n  randomly 
placed  nodes  and  assuming  a  Gaussian  fading  channel  model.  These  bounds  are 
tight  (in  the  scaling  sense)  along  n2  —  n  of  the  total  n2  dimensions  for  a  G  (2,  5] 
(corresponding  to  balanced  traffic),  and  along  all  n 2  dimensions  for  a  >  5.  This 
approximate  characterization  is  in  terms  of  2 n  weighted  cuts,  which  are  based  on  the 
geometry  of  the  locations  of  the  source  nodes  and  their  destination  nodes  and  on  the 
traffic  demands  between  them,  and  thus  can  be  readily  evaluated. 

This  characterization  is  obtained  by  establishing  that,  for  balanced  traffic  or  for 
a  >  5,  the  unicast  capacity  region  of  a  capacitated  tree  graph  under  routing  has  the 
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same  scaling  as  the  unicast  capacity  region  of  the  original  wireless  network.  The  leaf 
nodes  of  this  tree  graph  correspond  to  the  nodes  in  the  wireless  network,  and  internal 
nodes  of  the  tree  graph  correspond  to  hierarchically  growing  sets  of  nodes. 

This  equivalence  suggests  a  three-layer  communication  architecture  for  achieving 
the  entire  unicast  capacity  region  (in  the  scaling  sense).  The  top  or  routing  layer 
establishes  paths  from  each  of  the  source  nodes  to  its  destination  over  the  tree  graph. 
The  middle  or  cooperation  layer  provides  this  tree  abstraction  to  the  routing  layer 
by  distributing  the  traffic  among  the  corresponding  subset  of  nodes  as  a  message 
travels  up  the  tree  graph,  and  by  concentrating  the  traffic  on  to  the  corresponding 
subset  of  nodes  as  the  message  travels  down  the  tree.  The  bottom  or  physical  layer 
implements  this  distribution  and  concentration  of  traffic  over  the  wireless  network. 
The  implementation  of  this  distribution  and  concentration  of  traffic  depends  on  the 
path-loss  exponent:  For  low  path-loss  exponent  (a  €  (2,3]),  hierarchical  relaying  is 
used,  while  for  high  path-loss  exponent  (cc  >  3),  multi- hop  communication  is  used. 
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Chapter  5 


Service  Heterogeneity:  Multicast 


In  this  chapter,  we  analyze  the  scaling  of  the  n  x  2" -dimensional  multicast  capacity 
region  AMC(n)  of  a  wireless  network  of  n  randomly  placed  nodes  under  a  Gaussian 
fading  channel  model. 

We  first  present  an  inner  and  an  outer  bound  to  the  n  x  2n-dimensional  multicast 
capacity  region.  We  show  that  the  two  bounds  coincide  (up  to  scaling)  along  n2n  — 
n  dimensions  (corresponding  to  balanced  traffic)  for  a  G  (2,  5]  and  along  all  n2n 
dimensions  for  a  >  5.  We  show  that,  as  the  unicast  capacity  region,  the  multicast 
capacity  region  can  be  approximated  by  a  polytope  with  less  than  2 n  faces,  each 
corresponding  to  a  distinct  cut  (i.e.,  a  subset  of  nodes)  in  the  wireless  network.  Again, 
as  in  the  unicast  case,  only  2 n  out  of  2”  possible  cuts  in  the  wireless  network  are 
asymptotically  relevant. 

Second,  we  show  how  the  three-layer  communication  architecture  introduced  in 
Chapter  4  can  be  adapted  for  the  transmission  of  multicast  traffic.  Recall  the  three 
layers  of  this  scheme:  The  routing  layer,  the  cooperation  layer,  and  the  physical  layer. 
We  show  that  only  the  routing  layer  needs  to  be  changed  to  accommodate  multicast 
traffic.  The  other  two  layers  are  unaffected.  The  approximate  optimality  of  this 
three-layer  architecture  implies  that  a  separation  based  approach,  in  which  routing  is 
performed  independently  of  the  physical  layer,  is  order-optimal  for  balanced  multicast 
traffic  or  for  a  >  5,  and  hence  techniques  such  as  network  coding  are  not  necessary 
for  order-optimal  transmission  of  multicast  traffic  in  wireless  networks. 
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5.0.1  Organization 

The  remainder  of  this  chapter  is  organized  as  follows.  Section  5.1  presents  the  main 
results  of  this  chapter.  In  Section  5.2,  we  analyze  various  examples  scenarios  with 
heterogeneous  multicast  traffic  patterns  for  which  no  scaling  results  were  previously 
known.  Section  5.3  provides  a  high-level  description  of  the  modifications  to  the  three- 
layer  architecture  required  for  multicast  traffic.  Section  5.4  contains  the  proofs  of 
the  main  results.  Finally,  Sections  5.5  and  5.6  contain  discussions  and  concluding 
remarks. 


5.1  Main  Results 

This  section  contains  the  main  results  of  this  chapter.  In  Section  5.1.1,  we  provide 
inner  and  outer  bounds  on  the  multicast  capacity  region  AMC(n)  and  conditions  for 
these  bounds  to  be  tight  in  the  scaling  sense.  In  Section  5.1.2,  we  discuss  implications 
of  these  results  on  the  behavior  of  the  multicast  capacity  region  for  large  values  of  n. 
We  consider  computational  aspects  in  Section  5.1.3. 


5.1.1  Multicast  Capacity  Region 

Recall  the  definition  of  L(n )  in  (4.1)  and  of  the  subsets  {W,i(^)}y=i  of  V(n)  at  level 
t  in  Section  4.1.1.  Define 


A rw  =  AML  e 


A  I  \  MC  tu n x 2n  . 


E  E  AS  +  E  E  AS  <  (4 


ueVe  i(n)  WcV(n): 

wrvf^n)^ 


ueWfn)  WcV(n): 


Wnve4n)^<6 

V£e  {!,..., £(»)}, ie  {!,... 4'}, 


E  «-+E  E 

u^u  Wcv(n): 
u£W 

Vw  e  R(n)|, 


W  CV  (n): 
W\{u}^ 


(5.1) 
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and 


A^c(n)  =  i  Amc  g  IT*2" 


E  E  «-s(4 


-*n\2-min{3,a}/2 


ueVe  i(ri)  WcV(n): 

wnvecti(n)ji<l 


W  G  {1, . . . ,  L(n)},i  G  {1, . . .  4*}, 


Y  +  Yj  Y,  < 1 

u^u 

Vw  G  V(n) 


WcV(n): 

W\{u}^ 


u^u  Wcv(n): 
u€W 


The  definitions  of  Aj/ic(n)  and  A^/lc(n)  are  similar  to  those  of  A^c(n)  and  A2C(n) 
in  Chapter  4.  A^c(n)  and  A^c(n)  are  the  collection  of  all  multicast  traffic  matrices 
AMC  such  that  for  various  cuts  S  C  V(n)  in  the  network,  the  total  traffic  demand  (in 
either  one  or  both  directions) 


E\  ^  \  MC 

/  ,  Au,W 

u£S  WcV(n): 

wrsc^% 

/  ,  \w  +  2-, 

«SS  WcV(n):  u&Sc  WcV(n): 

wnsc^0  mns^0 

across  the  cut  S  is  not  too  big.  Note  that,  unlike  in  the  definitions  of  A)JC(n)  and 
A2C(n),  we  count  \u,w  as  crossing  the  cut  S  (in  the  outgoing  direction,  say)  if  u  E  S 
and  WnSc  ±  0,  i.e. ,  if  there  is  at  least  one  node  w  in  the  multicast  destination  group 
W  that  lies  outside  S.  The  number  of  such  cuts  S  we  need  to  consider  is  at  most  2 n, 
as  in  the  unicast  case. 

The  next  theorem  shows  that  Aj/ic(n)  is  an  approximate  (in  the  scaling  sense) 
inner  bound  and  A^c(n)  is  an  approximate  outer  bound  to  the  multicast  capacity 
region  AMC(n)  of  the  wireless  network. 
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Theorem  5.1.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exist 


bi(n )  >  n 

b2(n)  =  0(log6(n)), 


such  that 

6i(n)Aj/lc(n)  C  A MC(n)  C  b2(n)A™c(n), 
with  probability  1  —  o(l)  as  n  — >  oo. 

Theorem  5.1  holds  only  with  probability  1  —  o(l)  for  different  reasons  for  the 
fast  and  slow  fading  cases.  Under  fast  fading,  the  theorem  holds  for  almost  all  node 
placements.  Under  slow  fading,  the  theorem  holds  under  the  same  conditions  on  the 
node  placement,  but  now  it  also  only  holds  for  almost  all  realization  of  the  fading 

\@u,v  /  u,v 

Comparing  the  expressions  for  Aj/ic(n)  and  A^c(n),  we  see  that  whenever  a  traffic 
matrix  AMC  satisfies 


V  A«  +  X  X  <n°V  v  A*  (5.2) 

W<ZV{n):  uGVf^n)  WcV(n):  WcV(n): 

wnvp.{n)^%  ’  ivny,i(n)^  wnvp/n)^® 

for  all  £  E  {1, . . . ,  L(n)},  i  E  {1, . . .  4^}  then  AMC  G  A^/,c(n)  implies  n-°^AMC  G 
Aj/ic(n),  and  hence,  for  such  traffic  matrices,  the  inner  and  outer  bound  in  Theorem 
5.1  coincide  up  to  scaling.  In  particular,  this  applies  for  multicast  traffic  matrices 
AMC  such  that  (5.2)  holds  with  equality.  We  call  such  traffic  (approximately)  balanced 
in  the  following.  Note  that  the  the  condition  of  balanced  multicast  traffic  imposes  (at 
most)  n  linear  constraints  on  AMC,  and  hence  the  inner  and  outer  bound  in  Theorem 
5.1  are  tight  up  to  scaling  along  at  least  n2n  —  n  out  of  all  n 2n  total  dimensions  of 
AMC(n). 

For  high  path- loss  exponent  (a  >  5),  the  next  theorem  shows  that  the  inner  bound 
Aj/ic(n)  in  Theorem  5.1  is  also  an  approximate  outer  bound  to  AMC(n). 
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Figure  5-1:  For  a  >  5,  the  set  A^c(n)  approximates  the  multicast  capacity  region 
AMC(n)  of  the  wireless  network  in  the  sense  that  6i(n)Aj/lc(n)  (with  bi(n)  >  n-0^) 
provides  an  inner  bound  to  AMC(n)  and  63(n)Aj/lc(n)  (with  b3(n)  =  0(log6  («))) 
provides  an  outer  bound  to  AMC(n).  The  figure  shows  two  dimensions  (namely  Aw"2  3| 
and  Ag{Ci,3})  of  the  n  x  2”-dimensional  set  AMC(n).  The  same  approximation  result 
holds  for  a  G  (2,  5]  along  at  least  n2n  —  n  out  of  n 2n  dimensions. 


Theorem  5.2.  Under  either  fast  or  slow  fading,  for  any  a  >  5,  there  exists 


b3(n)  =  0(log6(n)), 


such  that 

A MC(n)  C  b3(n)A™c(n), 
with  probability  1  —  o(l)  as  n  — >  oo. 


Theorems  5.1  and  5.2  imply  that  the  quantity  Aj/ic(n)  determines  the  scaling  of  the 
multicast  capacity  region  AMC(n)  for  a  >  5  and  along  all  dimensions  corresponding 
to  balanced  traffic  for  a  G  (2,  5].  This  is  illustrated  in  Figure  5-1.  The  approximation 
is  within  a  factor  n±0^\  which,  as  in  the  unicast  case,  can  be  further  sharpened  (see 
Section  4.7.2  in  Chapter  4  for  a  discussion). 
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5.1.2  Implications  of  Theorems  5.1  and  5.2 

As  in  the  unicast  case,  Theorems  5.1  and  5.2  can  be  applied  in  two  ways.  First, 
the  theorem  can  be  used  to  analyze  the  asymptotic  achievability  of  a  sequence  of 
multicast  traffic  matrices.  Let  {AMC(n)}n>i  be  such  a  sequence  of  multicast  traffic 
matrices  with  AMC(n)  G  M™*2".  Define 

p*XMc(n)  =  sup{p  :  pAMC(n)  G  AMC(n)}, 
p*XMc{n)  =  sup{p  :  pAMC(n)  G  AMC(n)}, 


in  analogy  to  the  unicast  case.  Theorems  5.1  and  5.2  state  that  if  either  a  >  5  or  all 
AMC(n)  are  balanced  then1 


lim 

n— >oo 


log(n) 


lim 

n— >oo 


log(PAMc(^)) 

log(n) 


Several  applications  of  this  approach  are  explored  in  Section  5.2. 

Second,  Theorems  5.1  and  5.2  provide  information  about  the  shape  of  the  multicast 
capacity  region  AMC(n).  As  in  the  unicast  case,  the  boundary  points  of  AMC(n)  vary 
at  least  from  n-min(3>«}/2+0(1)  to  n~°^\  and  this  variation  on  exponential  scale  is 
preserved  by  A^c(n)  and  A^c(n).  See  also  Section  4.1.2  for  more  details. 


5.1.3  Computational  Aspects 

In  this  section,  we  show  that  Aj/ic(n)  can  be  efficiently  described.  By  Theorems  5.1 
and  5.2  this  yields  a  computationally  efficient  approximate  description  of  the  entire 
multicast  capacity  region  AMC(n)  for  a  >  5  and  of  n2n  —  n  of  its  n2n  dimensions  for 
a  G  (2,5], 

The  multicast  capacity  region  AMC(n)  is  a  nx  2”-dimensional  set,  i.e.,  the  number 
of  dimensions  is  exponentially  large  in  n.  Nevertheless,  its  approximation  Aj/ic(n) 
can  (as  in  the  unicast  case)  be  computed  by  evaluating  at  most  2 n  cuts.  This  yields 

1We  again  assume  that  the  limits  exist,  otherwise  the  same  statement  holds  for  lim  sup  and 
lim  inf. 
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a  very  compact  approximate  representation  of  the  multicast  capacity  region  AMC(n) 
(i.e.,  we  represent  a  region  of  exponential  size  in  n  as  an  intersection  of  only  linearly 
many  halfspaces  —  one  halfspace  corresponding  to  each  cut).  Moreover,  it  implies 
that  membership  AMC  G  A^c(n)  can  be  computed  efficiently.  More  precisely,  evalu¬ 
ating  each  of  the  0 (n)  cuts  takes  at  most  | {(u,W)  :  A^jy  >  0} |  operations.  Thus 
membership  AMC  G  Ajyic(n)  can  be  tested  in  at  most  O(n)  times  more  operations  than 
required  to  just  read  the  problem  parameters.  In  other  words,  we  have  a  linear  time 
(in  the  length  of  the  input)  algorithm  for  testing  membership  of  a  multicast  traffic 
matrix  AMC  in  Aj/ic(n),  and  hence  for  a  >  5  or  balanced  AMC  also  for  approximate 
testing  of  membership  in  AMC(n).  However,  this  algorithm  is  not  necessarily  linear 
time  in  n  since  reading  just  the  input  AMC  G  M™*2"  itself  might  take  exponential  time 
in  n. 


5.2  Example  Scenarios 

Here  we  consider  several  multicast  scenarios.  The  first  two  examples  consider  broad¬ 
cast  traffic.  The  third  and  fourth  examples  consider  proper  multicast  traffic  (i.e.,  only 
a  subset  of  nodes  acts  as  destinations). 

Example  5.1.  Broadcast  from  one  source 

Assume  we  have  only  one  source  (say  uq  G  V(n))  that  wants  to  broadcast  the 
same  message  to  all  other  nodes.  In  other  words,  we  consider  the  multicast  traffic 
matrix 

mc  I  p(n)  if  u  =  uq  and  W  =  V ( n ), 

=  ^ 

I  0  else, 

for  some  p{n)  >  0.  Applying  Theorem  5.1  yields  that  p*(n ),  the  largest  achievable 
p(n),  satisfies 

p*(n)  =  n±0^ 

with  probability  1  —  o(l)  as  n  — >  oo.  This  implies  that  the  source  can  broadcast  its 
information  at  essentially  constant  rate  independent  of  n  to  all  nodes  in  the  network. 
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Hence,  broadcasting  information  from  one  source  to  all  nodes  in  the  network  is  (at 
least  asymptotically)  no  harder  than  transmitting  information  from  one  source  to  its 
destination.  0 

Example  5.2.  Broadcast  from  many  sources 

Consider  a  scenario  with  rC  sources,  {ui, . . . ,  unp),  for  some  0  <  (3  <  1,  each 
broadcasting  an  independent  message  to  all  other  nodes  at  the  same  rate.  In  other 
words,  we  have  a  multicast  traffic  matrix  of  the  form 

MC  I  p(n)  if  u  —  Ui  for  some  i  and  W  =  V(n), 

P u,W  =  \ 

I  0  else, 

for  some  p(n)  >  0.  Applying  Theorem  5.1  yields  that  p*(n),  the  largest  achievable 
p(n),  satisfies 

p*(n)  =  n_/3±o(1), 

with  probability  1  —  o(l)  as  n  — >  oo.  0 

Example  5.3.  Multicast  from  many  sources 

Assume  each  node  generates  independent  multicast  traffic  for  a  set  of  nP  destina¬ 
tions.  All  these  sources  and  their  destinations  are  chosen  independently  and  uniformly 
at  random  from  V(n).  Let  {ui, . . . ,  un}  be  the  source  nodes  and  {Wi, . . . ,  Wn  with 
\Wi\  =  n13  the  corresponding  multicast  nodes.  Then  the  multicast  traffic  matrix  AMC 
is  of  the  form 

MC  I  P(n )  if  u  —  W  =  Wi  for  some  i, 

^u,w  =  ^  (5-3) 

I  0  else, 

for  some  p(n)  >  0.  This  traffic  matrix  is  approximately  balanced  with  high  probabil¬ 
ity,  and  hence  applying  Theorem  5.1  yields  that  p*(n)  satishes 

p*(n)  =  min  {n±0(1) ,  n(1-/3)(2-“/2)-i±°(i) }  (5.4) 
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with  probability  1  —  o(l)  as  n  — >  oo,  and  with 

a  =  min{3,  a}. 

Note  that  for  fa  =  0,  we  recover  the  result  for  unicast  traffic  with  random  source- 
destination  pairing  and  uniform  rate.  0 

Example  5.4.  Localized  multicast  from  many  sources 

Consider  the  setup  of  Example  5.3  above,  except  now  each  source  picks  rP1  des- 
tinations  uniformly  at  random  from  among  nodes  within  a  distance  of  n  2  ,  where 
fa  >  fa.  In  other  words,  each  source  node  performs  localized  multicast.  Again,  let 
{«!, . . . ,  unj  denote  the  source  nodes  and  {Wi, . . . ,  Wn},  with  \Wt\  —  rP1 ,  denote  the 
corresponding  destination  nodes,  where  now  rUi;V  <  n  2  ,  for  each  v  E  Wi.  The  multi¬ 
cast  traffic  matrix  AMC  is  of  the  form  (5.3).  This  traffic  matrix  is  again  approximately 
balanced  with  high  probability,  and  an  application  of  Theorem  5.1  shows  that  p*{n ) 
satisfies 


p*(n)  =  min{n±o(1),n(/32_/3l)(2““/2)_/32}±o(1)} 

with  probability  1  —  o(l)  as  n  — >  00.  Note  that  setting  fa  =  1,  i.e.,  each  source  picks 
its  destinations  uniformly  over  the  entire  region  A(n),  yields  the  same  scaling  of  p*{n) 
as  in  Example  5.3.  Similarly,  for  fa  =  0,  and  fa  =  fa  we  recover  the  unicast  scenario 
with  K  =  1  in  Example  4.1.  0 

5.3  Communication  Scheme  for  Multicast  Traffic 

Here  we  show  that  the  same  communication  scheme  presented  in  Section  4.3  for 
general  unicast  traffic  can  also  be  used  to  transmit  general  multicast  traffic.  It  is  the 
tree  structure  of  the  scheme  that  is  critically  exploited  in  the  proof  of  Theorem  5.1 
to  obtain  an  approximation  of  the  multicast  capacity  region  AMC(n). 

We  will  use  the  same  three-layer  architecture  as  for  unicast  traffic  presented  in 
Section  4.3.  Note  that  for  this,  we  only  need  to  modify  the  operation  of  the  top  or 
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Figure  5-2:  Example  operation  of  the  routing  layer  in  the  three-layer  architecture 
under  multicast  traffic. 

routing  layer.  Indeed,  no  matter  how  we  configure  the  routing  layer,  the  lower  layers 
operate  as  before. 

We  now  outline  how  the  routing  layer  needs  to  be  adapted  for  the  multicast 
case.  Consider  a  multicast  message  that  needs  to  be  transmitted  from  a  source  node 
u  €  V(n)  to  its  set  of  intended  destinations  W  C  V(n).  In  the  routing  layer,  we 
want  to  route  this  message  from  u  to  W  over  G.  Since  G  is  a  tree,  the  routing  part 
is  simple.  In  fact,  between  u  and  every  w  G  W  there  exists  a  unique  path  in  G. 
Consider  the  union  of  all  those  paths.  It  is  easy  to  see  that  this  union  is  a  subtree  of 
G.  Indeed,  it  is  the  smallest  subtree  of  G  that  covers  {u}  U  W.  Traffic  is  optimally 
routed  over  G  from  u  to  W  by  sending  it  along  the  edges  of  this  subtree. 

The  next  example  illustrates  the  operation  of  the  routing  layer  under  multicast 
traffic. 

Example  5.5.  Consider  one  source  node  u  and  the  corresponding  multicast  group 
W  =  {wi,  W2,  w3}  as  shown  in  Figure  5-2. 

In  the  routing  layer,  we  find  the  smallest  subgraph  G({u}  U  W)  covering  {u}  U  W 
(indicated  by  black  lines  in  Figure  5-2).  Messages  are  sent  from  the  source  to  its 
destinations  by  routing  them  along  this  subgraph.  In  other  words,  G({u}  U  W)  is  the 
multicast  tree  along  which  the  message  is  sent  from  u  to  W .  The  cooperation  layer 
and  physical  layer  operate  in  the  same  way  as  for  unicast  traffic  (see  Figure  4-4  for 
an  example).  0 
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5.4  Proofs 


This  Section  contains  the  proofs  of  Theorem  5.1  (in  Section  5.4.1)  and  of  Theorem 
5.2  (in  Section  5.4.2). 

5.4.1  Proof  of  Theorem  5.1 

The  proof  of  Theorem  5.1  relies  on  linking  the  multicast  capacity  region  to  the  unicast 
capacity  region.  We  say  that  a  unicast  traffic  matrix  Auc  is  compatible  with  a  multicast 
traffic  matrix  AMC  if  there  exists  a  mapping  /  :  V(n)  x  2V (n)  — >  V(n)  such  that 
f(u,  W)  E  W,  for  all  (u,  W),  and 

\UC  _  \MC 

Au,v  ~  Au,W 

WcV(n): 

f(u,W)=v 

for  all  (u,  v).  In  words,  AMC  is  compatible  with  Auc  if  we  can  create  the  unicast  traffic 
matrix  Auc  from  AMC  by  simply  discarding  the  traffic  for  the  pair  (u,  W)  at  all  the 
nodes  W\{f(u,  W)}.  Let  T(AMC)  be  the  set  of  all  unicast  traffic  matrices  compatible 
with  the  multicast  traffic  matrix  AMC.  We  extend  the  definition  of  T  to  sets  of  traffic 
matrices  as  follows: 


T(AMC(n))  =  |J  T(Amc), 

AMCeAMC(n) 

T-^A^n))  =  {AMC  :  T(Amc)  C  Auc(n)}. 

In  words,  T  (AMC(n))  is  the  collection  of  all  unicast  traffic  matrices  that  are  compatible 
with  a  multicast  traffic  in  AMC(n),  and  T~1(A  uc(n))  is  the  collection  of  all  multicast 
traffic  matrices  that  have  all  their  compatible  unicast  traffic  matrices  in  Auc(n). 

We  start  with  some  auxiliary  lemmas  for  the  outer  bound  in  Theorem  5.1. 

Lemma  5.3.  Under  either  fast  or  slow  fading,  for  any  a  >  2, 

AMC(n)  C  T-^A^n)). 
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Proof.  It  is  clear  that  if  AMC  G  AMC  then  Auc  €  Auc  for  any  Auc  G  r(AMC).  Indeed, 
we  can  reliably  transmit  at  rate  Auc  by  using  the  communication  scheme  for  AMC  and 
simply  discarding  the  messages  delivered  by  this  scheme  at  all  but  one  node  in  each 
multicast  destination  group.  Thus  AMC  G  AMC  implies  T(AMC)  C  Auc,  and  therefore 

t(amc)  =  |J  t(amc)cauc, 

AMCgAMC 

or,  equivalently, 

amc  c  r-i(Auc), 

proving  the  lemma.  □ 

We  now  prove  several  auxiliary  results  for  the  inner  bound  in  Theorem  5.1.  Con¬ 
sider  again  the  tree  graph  G  =  (Vq,Eq)  with  leaf  nodes  V(n)  C  Vq  constructed  in 
Section  4.5.  As  before,  we  consider  traffic  between  leaf  nodes  of  G.  In  particular, 
any  multicast  traffic  matrix  AMC  G  M+x2"  for  the  wireless  network  is  also  a  multicast 
traffic  matrix  for  the  graph  G.  Denote  by  A^c(n)  C  M+x2"  the  set  of  feasible  (under 
routing)  multicast  traffic  matrices  between  leaf  nodes  of  G. 

The  next  lemma  shows  that  if  multicast  traffic  can  be  routed  over  G  then  ap¬ 
proximately  the  same  multicast  traffic  can  be  transmitted  reliably  over  the  wireless 
network.  Before  we  state  that  lemma,  recall  the  definition  of  the  set  V(n)  of  node 
placements  V(n)  that  satisfy  certain  regularity  conditions  as  defined  in  Section  4.4.1. 

Lemma  5.4.  Under  fast  fading,  for  any  a  >  2,  there  exists  b(n)  >  n such  that 
for  all  V(n)  G  V(n), 


6(n)A^c(n)  C  AMC(n). 

The  same  statement  holds  under  slow  fading  with  probability  1  —  o(l)  as  n  — >  oo. 

Proof.  The  proof  follows  using  the  same  construction  as  in  Lemma  4.10.  □ 

We  now  show  that  since  G  is  a  tree  graph,  the  multicast  capacity  region  A^c(n) 
can  be  described  through  the  unicast  capacity  region  A)ic(n)  of  G.  The  fact  that  G 
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is  a  tree  is  critical  for  this  result  to  hold. 


Lemma  5.5.  For  any  a  >  2, 

r-W(n))  C  A^c(n). 

Proof.  Assume  that  AMC  ^  A^c.  Since  G  is  a  tree,  there  is  only  one  way  to  route 
multicast  traffic  from  u  to  W,  namely  along  the  subtree  G({w}  U  W)  induced  by 
{u}  U  W  (i.e.j  the  smallest  subtree  of  G  that  covers  {«}  U  W).  Hence  for  any  edge 
e  G  Eq,  the  traffic  dx Mc(e)  that  needs  to  be  routed  over  e  is  equal  to 

dAMC(e)  =  ^2  ^uyv- 

uGV,W  GV : 
e^E0({u}UW) 

Now,  since  AMC  f  A^c,  there  exists  e  G  Eg  such  that 

d\ Mc(e)  >  ce. 

But  then,  by  definition  of  d\ Mc(e),  there  exists  a  function  /  :  V  x  2V  — >  V  with 
f(u,  W)  G  W  for  all  (u,  W),  and  such  that 

d\ufc(\MC)(e)  —  dXMc(e)  >  ce, 

where  Ayc(AMC)  is  the  unicast  traffic  matrix  resulting  from  applying  /  to  the  multicast 
traffic  matrix  AMC.  Thus  Ayc(AMC)  A^,c. 

We  have  shown  that  if  AMC  f  A^c  then  there  exists  Auc  G  T(AMC)  such  that  Auc  f 
AgC.  In  other  words,  T(AMC)  is  not  a  subset  of  A^,c,  and  therefore  AMC  ^  r_1(Ag,c). 
Hence  AMC  ^  A^c  implies  AMC  ^  r_1(Ag,c),  and  thus 

r_1(AGC)  c  a^c. 

□ 
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We  are  now  ready  for  the  proof  of  Theorem  5.1.  Note  that 


T(Amc)  C  A yc(n)  ^  AMC  e  \yc(n), 

and  hence 

r‘(Afw)  =  A*w, 

for  j  e  {1,2}. 


(5.5) 


For  the  inner  bound  in  Theorem  4.1,  we  have  for  V  E  V 

6i(n)A^c(n)  =  &i(n)r_1  (A^c(n)) 
=  6i(n)r_1(A^c(n)) 

C  6i(n)AcC(n) 

C  A MC(n) 


(5.6a) 

(5.6b) 

(5.6c) 

(5.6d) 


for  b\(n)  >  n  0(1\  and  where  (5.6a)  follows  from  (5.5),  (5.6b)  follows  from  Lemma 
4.11,  (5.6c)  follows  from  Lemma  5.5,  and  (5.6d)  follows  from  Lemma  5.4. 


For  the  outer  bound  in  Theorem  4.1,  we  have  for  V  G  V 

AMC(n)  C  r_1(Auc(n))  (5.7a) 

C  62(n)r-1(A2uc(n))  (5.7b) 

=  b2(;n)A™c(;n)  (5.7c) 

for  &2(n)  =  0(log6(n)),  and  where  (5.7a)  follows  from  Lemma  5.3,  (5.7b)  follows  from 
Theorem  4.1,  and  (5.7c)  follows  from  (5.5). 

Since  V  G  V  with  probability  1  —  o(l)  by  Lemma  4.5,  this  proves  Theorem  5.1. 
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5.4.2  Proof  of  Theorem  5.2 


For  V  E  V,  and  a  >  5, 


AMC(n)  C  r-1(Auc(n))  (5.8a) 

C  63(n)r-1^AjJC(n))  (5.8b) 

=  b3(n)A^c(n)  (5.8c) 

for  b2(n)  =  O (log6 (n)) ,  and  where  (5.8a)  follows  from  Lemma  5.3,  (5.8b)  follows  from 
Theorem  4.2,  and  (5.8c)  follows  from  (5.5).  Since  V  £  V  with  probability  1  —  o(l)  by 
Lemma  4.5,  this  proves  Theorem  5.2. 


5.5  Discussion 

We  discuss  several  aspects  and  extensions  of  the  modified  three-layer  architecture 
introduced  in  Section  5.3  for  transmission  of  multicast  traffic.  In  Section  5.5.1,  we 
show  how  the  results  for  multicast  traffic  in  extended  networks  can  be  used  to  obtain 
scaling  results  for  multicast  traffic  in  dense  networks.  In  Section  5.5.2,  we  discuss 
design  guidelines  for  the  transmission  of  multicast  traffic  in  large  wireless  networks. 

5.5.1  Dense  Networks 

Up  to  this  point,  we  have  only  considered  multicast  traffic  in  extended  networks,  i.e., 
n  nodes  are  located  on  a  square  of  area  n.  As  in  previous  chapters,  we  now  briefly 
sketch  how  these  results  can  be  recast  for  dense  networks,  in  which  n  nodes  are  located 
on  a  square  of  unit  area.  As  in  the  unicast  case,  the  scaling  of  the  multicast  capacity 
region  for  dense  networks  can  be  obtained  from  the  one  for  extended  networks  by 
taking  a  limit  as  a  — >  2. 

The  resulting  approximate  multicast  capacity  regions  AMC(n)  has  again  a  partic¬ 
ularly  simple  shape  in  this  limit.  As  in  the  unicast  case,  the  only  constraints  in  (5.1) 
that  can  be  tight  are  at  level  t  =  log(n).  This  results  in  the  following  approximate 
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multicast  capacity  region  for  dense  networks: 


Xrc(«)  A  {amc  e  MJ*2"  :  5]  +  X  5]  A$,  <  1,  Vi,  6  V»}, 

WcV{n):  u^u  WcV(n): 

W\{u}j^$  u£W 

and  we  obtain  that  for  dense  networks,  for  any  a  >  2, 

n-°(1)A^c(n)  C  AMC(n)  C  0(log6(n))A^c(n). 

Note  that,  in  contrast  to  the  extended  case,  this  results  in  an  approximate  character¬ 
ization  of  the  entire  unicast  region  for  all  a  >  2. 

5.5.2  Design  Guidelines 

Several  design  guidelines  for  multicast  traffic  in  large  wireless  networks  can  be  ob¬ 
tained  from  the  results  presented  in  this  chapter.  First,  observe  that  the  only  modifi¬ 
cation  to  the  three-layer  architecture  introduced  in  Chapter  4  for  unicast  traffic  is  in 
the  routing  layer.  This  suggests  that  multicasting  in  wireless  networks  can  be  handled 
by  appropriate  routing  alone.  In  particular,  for  balanced  multicast  traffic  or  a  >  5, 
cross-layer  techniques  such  as  network  coding  (which  can  provide  an  arbitrarily  large 
increase  in  achievable  rates  in  wireline  networks)  are  not  necessary  for  order-optimal 
transmission  of  multicast  traffic  in  wireless  networks,  and  can  provide  at  most  a  factor 
n°W  increase  in  achievable  rates. 

Second,  note  that  the  problem  of  finding  multicast  trees  in  wireless  networks  is 
solved  very  efficiently  in  our  proposed  three-layer  architecture.  Namely,  we  map  the 
wireless  network  to  the  graph  G  and  then  look  for  multicast  trees  on  G.  Since  G 
is  a  tree,  finding  the  optimal  multicast  trees  in  G  is  trivial.  The  optimality  of  the 
three-layer  architecture  in  the  scaling  sense  for  balanced  traffic  or  a  >  5  suggests  this 
approach  for  multicast  routing  in  large  wireless  networks. 

Finally,  we  point  out  that  this  approach  of  finding  optimal  multicast  trees  in  G 
is  also  useful  in  streaming  applications,  in  which  the  nodes  that  are  subscribed  to 
a  certain  multicast  stream  can  vary  over  time.  In  fact,  the  optimal  multicast  tree 
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connecting  source  u  with  multicast  group  W  in  G  is  G({w}  U  W)  (i.e.,  the  smallest 
subtree  of  G  connecting  all  nodes  in  {w}  U  W).  Now,  note  that  G({u}  U  W)  has  the 
property  that  if  W  =  {wi, . . . ,  wm}  then 

m 

G({u}UW)  =  \jG({u,wi}), 

i= 1 

i.e.,  the  optimal  multicast  tree  connecting  the  source  u  with  its  subscribers  W  can 
be  constructed  as  the  union  of  all  the  paths  between  u  and  Wi.  This  implies  that  if 
a  new  node  subscribes  to  a  stream  available  at  u,  the  optimal  multicast  tree  can  be 
updated  efficiently  by  just  adding  this  new  path  (and  similar  if  a  node  unsubscribes 
from  a  stream). 

5.6  Chapter  Summary 

In  this  chapter,  we  have  obtained  inner  and  outer  bounds  for  the  n  x  2n-dirriensional 
multicast  capacity  region  of  a  wireless  network  with  n  randomly  placed  nodes  and 
assuming  a  Gaussian  fading  channel  model.  These  inner  and  outer  bounds  coincide 
(up  to  scaling)  along  at  least  n2n  —  n  out  of  n2n  dimensions  for  a  G  (2,  5]  and  in  all 
ri2n  dimensions  for  a  >  5.  As  in  the  unicast  case,  this  approximate  characterization 
of  the  multicast  capacity  region  is  in  terms  of  2 n  weighted  cuts.  This  provides  a 
compact  approximate  representation  of  the  multicast  capacity  region. 

We  have  shown  how  the  three-layer  communication  architecture  introduced  for 
general  unicast  traffic  can  be  modified  for  multicast  traffic.  Out  of  the  three  layers, 
only  the  top  or  routing  layer  needed  to  be  changed  —  the  other  layers  operate  as  in 
the  unicast  case. 

This  scheme  also  establishes  that  a  separation  based  approach,  where  the  routing 
layer  works  essentially  independently  of  the  physical  layer,  is  order  optimal  for  bal¬ 
anced  traffic  or  when  a  >  5.  Thus,  such  techniques  as  network  coding  can  provide  at 
most  a  small  increase  in  the  scaling. 
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Chapter  6 


Service  Heterogeneity:  Caching 


In  this  chapter,  we  analyze  the  scaling  of  the  2n  x  n-dimensional  caching  capacity 
region  ACA(n)  under  random  node  placement.  We  present  an  achievable  communica¬ 
tion  scheme  for  the  caching  problem,  yielding  an  inner  bound  on  the  caching  capacity 
region.  For  large  values  of  path- loss  exponent,  we  provide  a  matching  (in  the  scal¬ 
ing  sense)  outer  bound,  proving  the  optimality  (again  in  the  scaling  sense)  of  our 
proposed  scheme.  Together,  this  provides  a  scaling  description  of  the  entire  caching 
capacity  region  of  the  wireless  network  in  the  large  path-loss  regime.  The  proposed 
communication  scheme  solves  the  problem  of  optimal  cache  selection  and  channel 
coding  separately,  showing  that  such  a  separation  is  order-optimal. 


6.0.1  Organization 

The  remainder  of  the  chapter  is  organized  as  follows.  In  Section  6.1,  we  present  the 
main  results  of  this  chapter.  We  analyze  several  example  scenarios  in  Section  6.2. 
In  Section  6.3,  we  introduce  the  communication  scheme  achieving  the  inner  bound 
on  ACA(n).  Section  6.4  contains  proofs.  Sections  6.5  and  6.6  contain  discussions  and 
concluding  remarks. 
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6.1  Main  Results 


In  Section  6.1.1,  we  provide  an  inner  and  a  matching  (in  the  scaling  sense)  outer 
bound  on  the  capacity  region  ACA(n).  In  Section  6.1.2,  we  discuss  computational 
aspects. 


6.1.1  Caching  Capacity  Region 

Recall  the  construction  of  the  graph  G  =  (Vg,  Eg)  introduced  in  Section  4.3.  G  is  a 
tree  with  leaf  nodes  V (n)  C  Vq.  Leaf  nodes  in  G  share  the  same  parent  node  in  G  if 
they  fall  within  the  same  grid  square  at  level  L{n)  in  A{n)  (with  L{n )  as  defined  in 
(4.1)).  Nodes  at  level  t  in  the  tree  G  share  the  same  parent  node  if  all  their  children 
fall  in  the  same  grid  square  at  level  l  —  1  in  A{n).  This  construction  is  illustrated  in 
Figure  4-3  in  Chapter  4.  As  in  Chapter  4,  we  assign  to  each  edge  e  G  Eq  at  level  i 
in  G  (i.e.,  between  nodes  at  levels  l  and  £  —  1)  a  capacity 

(4- £n)2-min{3,Q}/2  if  1  <  t  <  L(n), 

1  if  £  =  L(n)  +  1. 

With  slight  abuse  of  notation,  we  let  for  (w,  v)  =  e  G  EG 

A_ 

Cu,v  ^e* 

As  we  shall  see  in  the  following,  the  caching  capacity  region  ACA(n)  is  closely 
related  to  the  following  quantity: 

Ac\n)^(\CAeR2;xn-  E  E  E  Cu,vVScVG 

^  UcSnV(n)w£V(n)\S  (u,v)&Ea-. 

ueS,v(£S 

The  region  A CA(n)  is  described  by  various  subsets  S  C  VG.  Each  such  subset  can  be 
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understood  as  a  cut  in  the  graph  G.  For  every  cut  S  C  Vq,  the  sum  rate 

UaSrV(n)  wOV(n)\S 

between  nodes  in  S  and  Sc  (i.e.,  across  the  cut)  is  bounded  by  the  sum  capacity 


(u,v)eEc'- 

uES,v(£S 

of  edges  between  S  and  Sc.  Note  that  we  only  count  traffic  AjFA„  such  that  all  caches 
U  are  contained  in  S. 

The  first  result  states  that  for  all  a  >  2,  ACA(n)  is  an  approximate  inner  bound 
to  the  caching  capacity  region  ACA(n). 

Theorem  6.1.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  bi(n)  > 
n -0('1)  such  that 

6i(n)ACA(n)  C  ACA(n) 
with  probability  1  —  o(l)  as  n  oo. 

We  point  out  that  Theorem  6.1  holds  only  with  probability  1  —  o(l)  for  different 
reasons  in  the  fast  and  slow  fading  case.  For  fast  fading,  the  theorem  holds  only  for 
node  placements  that  are  “regular”  enough.  A  random  node  placement  satisfies  these 
regularity  conditions  with  high  probability  as  n  — >  oo.  For  slow  fading,  Theorem  6.1 
holds  under  the  same  regularity  conditions  on  the  node  placement,  but  moreover  only 
holds  for  almost  all  realizations  of  the  channel  gains. 

The  next  result  provides  an  approximate  matching  outer  bound  to  ACA(n)  for 
large  values  of  path-loss  exponent  a  >  6. 

Theorem  6.2.  Under  either  fast  or  slow  fading,  for  any  a  >  6,  there  exists  b2[n )  < 
n° such  that 

A CA(n)  C  62(«)ACA(n) 
with  probability  1  —  o(l)  as  n  — >  oo. 
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Figure  6-1:  For  a  >  6,  the  set  ACA(n)  approximates  the  caching  capacity  region 
ACA(n)  of  the  wireless  network  in  the  sense  that  bi(n)ACA(n)  (with  b\(n)  >  n~°^) 
provides  an  inner  bound  to  A CA(n)  and  &2('^)ACA(n)  (with  b2(n )  <  n°^)  provides  an 
outer  bound  to  A CA(n).  The  figure  shows  two  dimensions  (namely  A^S.  2  and  3) 
of  the  2”  x  n-dimensional  sets  ACA(n)  and  ACA(n). 

As  Theorem  6.1,  Theorem  6.2  holds  only  with  high  probability  due  to  regularity 
conditions  on  the  node  placement.  However,  unlike  Theorem  6.1,  Theorem  6.2  holds 
for  all  realizations  of  channel  gains  also  for  the  slow  fading  case. 

Comparing  Theorems  6.1  and  6.2,  we  see  that  for  a  >  6,  the  caching  capacity 
region  A CA(n)  is  approximately  equal  to  A CA(n)  in  the  sense  that 

n-o(i)^CA(n)  c  ACA(n)  c  no(i)ACA(n). 

In  other  words,  for  a  >  6,  ACA(n)  scales  as  the  caching  capacity  region  A(n).  This  is 
illustrated  in  Figure  6-1. 

6.1.2  Computational  Aspects 

As  argued  in  previous  chapters,  since  we  are  interested  in  large  networks,  compu¬ 
tational  aspects  are  a  concern.  Note  that  the  approximate  caching  capacity  region 
ACA(n)  is  described  in  terms  of  essentially  @(4n)  cuts  S  C  Vq-  We  show  in  Example 
6.1  in  Section  6.2,  that  a  description  with  significantly  fewer  cuts  is  not  possible.  In 
other  words,  even  an  approximate  description  A CA(n)  of  the  caching  capacity  region 
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ACA(n)  is  computationally  intractable  for  large  values  of  n. 

On  the  other  hand,  consider  the  simpler  problem  of  testing  membership  of  Aca  € 
ACA(n).  We  now  argue  that  this  problem  can  be  approximately  solved  in  an  efficient 
manner.  More  precisely,  we  show  that  ACA  G  A CA(n)  can  be  checked  approximately 
in  polynomial  time  in  the  description  complexity  of  ACA.  Combined  with  Theorem 
6.1  and  6.2,  this  shows  that  for  a  >  6  approximate  membership  ACA  G  ACA(n)  can  be 
checked  efficiently  as  well. 

Formally,  define  for  any  caching  traffic  matrix  ACA  G  R+"xn 
Paca(u)  =  sup{p  >  0  :  pACA  G  ACA(n)}. 

Membership  ACA  G  ACA(n)  can  then  be  evaluated  by  checking  if  pxc/\(n)  <  1.  Let 
0aca  (n)  to  be  the  solution  to  the  following  linear  program 


max  0 


s.t. 

^  ^  fp,U,w 

>  <t>^% 

VC/  C  V{n),w  G  V(n), 

P&Pu,w 

E  E 

^  ^  fp,U,w 

V  eeEG, 

p£P:eSp  UcV(n) 

w£V(n) 

fp,U,w 

>  0 

VU  cV(n),weV(n),pePUtW 

where  Pu>w  is  the  path  in  G  from  node  u  to  node  w  (since  G  is  a  tree,  there  is  only 
one  such  paths),  and  where 


P  — 

*U,w  — 


=  U 


P, 


ueu 


p=  U  U 

UcV(n )  weV(n) 


Note  that  the  linear  program  (6.1),  and  hence  also  0aca (n),  can  be  evaluated  in 
polynomial  time  in  the  description  length  of  Aca  (i.e.,  in  polynomial  time  in  the 
length  of  the  “input”  of  the  linear  program)  by  setting  the  flow  variables  fPtu,w  to 
zero  whenever  A^A,  =  0  and  p  G  Pu,w  Moreover,  using  a  primal-dual  algorithm,  (6.1) 


177 


can  be  solved  efficiently  in  a  distributed  manner  (see,  for  example,  [41,  Chapter  3.7]). 

The  following  theorem  shows  that  0aca (n)  is  a  good  approximation  to  paca(ti). 

Theorem  6.3.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  63  > 
n-o(i)  suc^  piaf  jQr  any  n  (m(j  cac}nng  traffic  matrix  ACA  G  M+ xn 

b3(n)pxc*(n)  <  0aca (n)  <  pxc*(n). 

As  argued  above,  0aca  (n)  can  be  computed  in  polynomial  time  in  the  description 
length  of  Aca.  Hence  Theorem  6.3  shows  that  testing  membership  ACA  G  A CA(n)  can 
be  done  approximately  in  polynomial  time  in  the  description  length  of  ACA.  Combined 
with  Theorems  6.1  and  6.2  this  implies  that,  for  a  >  6,  approximate  achievability  of  a 
traffic  matrix  ACA  (i.e.,  testing  membership  ACA  G  ACA(n))  can  be  checked  efficiently 
and  in  a  distributed  fashion. 


6.2  Example  Scenarios 

Here  we  provide  three  examples  illustrating  various  aspects  of  the  caching  capacity 
region.  Example  6.1  shows  that  the  capacity  region  for  caching  is  inherently  more 
complicated  than  the  ones  resulting  from  unicast  or  multicast  traffic.  Example  6.2 
shows  that  the  strategy  of  always  selecting  the  nearest  cache  can  be  arbitrarily  bad. 
Example  6.3  analyzes  the  impact  of  complete  caches  on  the  performance  of  the  wireless 
network. 

Example  6.1.  Insufficiency  of  edge  cuts 

For  unicast  traffic  and  multicast  traffic,  we  have  seen  in  chapters  4  and  5  that  it 
is  sufficient  to  consider  edge  cuts  in  G,  i.e,  cuts  that  result  from  removing  a  single 
edge  from  G.  By  construction,  G  has  at  most  2 n  edges,  and  hence  there  are  at  most 
2 n  such  edge  cuts.  This  contrasts  with  the  situation  for  caching  traffic,  for  which 
Theorems  6.1  and  6.2  indicate  that  we  have  to  consider  general  cuts,  i.e.,  arbitrary 
subsets  S  of  Vq.  Indeed,  the  approximate  capacity  region  ACA(n)  is  expressed  in 
terms  of  essentially  0(4n)  cuts.  Comparing  these  two  results,  one  might  suspect  that 
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Figure  6-2:  Caching  traffic  pattern  for  Example  6.1. 

a  simpler  characterization  in  terms  of  edge  cuts  can  be  found  for  the  caching  capacity 
region  as  well.  This  example  shows  that  this  is  not  possible.  In  other  words,  the 
caching  capacity  region  is  inherently  more  complicated  than  the  unicast  or  multicast 
capacity  region  of  a  wireless  network. 

Assume  V2ii  (n)  and  V2j2 (n)  are  subsets  of  V\ ,i(n),  and  consider  two  nodes  u \  e 
V2li  (n),  u2  G  V2 ,2(n).  Construct 

\CA  A  I  p(n)  if  U  =  {ui,u2},w  e  Cl >2(n), 

Ki,w  —  ^ 

I  0  else, 

for  some  p(n)  >  0.  This  is  illustrated  in  Figure  6-2. 

The  best  edge  cut  results  from  removing  edge  e  in  Figure  6-2.  The  cut  capacity 
is  ce  =  n2~mm{3’a}/2  and  the  sum  rate  across  the  cut  is  \V\j2(n)\p(n).  By  Theorem  6.2 
and  for  a  >  6,  this  shows  that  p*(n ),  the  largest  achievable  value  of  p(n),  is  upper 
bounded  as 


p*(n)  <  |Ci,2(n)r1n2-min{3’a}/2+0(1)  =  n1-min{3.«}/2+°(i) 

with  high  probability. 

On  the  other  hand,  consider  the  general  node  cut  S  =  {ui,u2}.  The  cut  capacity 
here  is  2  and  the  sum  rate  across  the  cut  is  again  \Vij2(n)\p(n).  Moreover,  it  is  easily 
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checked  that  S  is  the  bottle  neck  cut  in  G.  Thus  Theorem  6.1  shows  that  p*(n)  is 
lower  bounded  as 

p*(n)  >  n"1_o(1),  (6.2) 

and,  for  a  >  6,  Theorem  6.2  shows  that 

p*(n)  <  tj,-1-0!1). 

In  this  example,  it  can  be  shown  that  the  correct  scaling  of  p*(n)  is  actually 

p*{n)  =  n_1±0^^ 

for  all  a  >  2  (not  just  a  >  6  as  suggested  by  Theorem  6.2).  Note  that  this  differs 
substantially  from  the  upper  bound  obtained  from  the  best  edge  cut  (6.2).  0 

Example  6.2.  Nearest  neighbor  cache  selection 

A  reasonable  strategy  of  selecting  caches  is  to  request  the  entire  message  from  the 
nearest  available  cache.  In  fact,  this  is  the  strategy  implicitly  assumed  in  most  of 
the  prior  work  considering  caching  in  wireless  networks  cited  in  Section  1.2.4.  This 
example  shows  that  this  strategy  can  be  arbitrarily  bad. 

Assume  V2,i(n)  and  V2:2(n)  are  subsets  of  Viti(n),  and  V2j3 (n)  is  a  subset  of  V\ ,2(n). 
Consider  a  node  u*  G  V2j2('i n),  and  label  the  nodes  in  V2;i (n)  =  {w±,w2, . . .}  and  in 
1/2,3 (n)  =  {ui,u2, . . .}.  Construct 

{p(n)  if  U  =  {u*,Ui},w  =  Wi  for  some  i , 

0  else, 

for  some  p(n)  >  0.  This  is  illustrated  in  Figure  6-3. 

For  every  wt ,  the  nearest  cache  is  u*.  Requesting  the  entire  message  from  it 
produces  a  unicast  traffic  pattern  resulting  in  a  per-node  rate  of  at  most 

p(n)  <  n-1+o(1) 
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Figure  6-3:  Caching  traffic  pattern  for  Example  6.2. 
for  all  a  >  2  (see  Chapter  4). 

Assume  now  each  wy  uses  only  the  more  distant  cache  .  This  achieves  a  value 
of  p{n)  of 

p(n)  >  n1-min{3>«}/2-o(i)  ^  n-i+o(i). 

Applying  Theorem  6.1  yields  the  same  77.1- mm{3,0!} /2— “^t1)  value  of  p(n),  and  Theorem 
6.2  confirms  that,  for  a  >  6,  no  scheme  can  achieve  a  better  scaling.  Hence 

p*(n)  = 

for  a  >  6,  and,  as  in  the  previous  example,  it  can  be  shown  that  this  is  the  correct 
scaling  of  p*(n)  also  for  a  G  (2,6].  This  shows  that  the  strategy  of  always  selecting 
the  nearest  cache  can  result  in  a  scaling  exponent  that  is  considerably  worse  than 
what  is  achievable  with  optimal  cache  selection.  0 

Example  6.3.  Complete  caches 

Assume  we  randomly  pick  rr  caches  for  [3  G  [0, 1),  each  holding  a  complete  copy 
of  all  the  messages.  More  precisely,  letting  W  =  be  the  collection  of  caches, 

we  consider  a  caching  traffic  matrix  ACA  G  xn  of  the  form 


A 


CA  _ 

W,  v  ~ 


j  p(n )  if  W 
I  0  else, 
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for  some  p(ri)  >  0.  In  this  setup,  choosing  the  nearest  cache  strategy  (as  discussed  in 
Example  6.2)  results  in  a  per-node  rate  of 

p{n )  > 

with  probability  1  —  o(l)  as  n  — >  oo.  The  three-layer  architecture  proposed  in  The¬ 
orem  6.1  achieves  the  same  rate,  and  Theorem  6.2  shows  that,  for  a  >  6,  for  any 
communication  scheme 

p{n)  <  n/3-1+°(1). 

Hence,  for  a  >  6, 

p*{n)  =  n^-1±o(1), 

and  it  can  be  shown,  as  in  the  previous  two  examples,  that  this  is  the  correct  scaling 
of  p*(n)  also  for  a  G  (2,  6]. 

This  example  illustrates  that  when  in  situations  in  which  the  traffic  demand  and 
location  of  caches  is  regular  enough,  the  strategy  of  selecting  the  nearest  cache  can 
actually  be  close  to  optimal.  0 


6.3  Communication  Scheme  for  Caching  Traffic 

Theorem  6.1  provides  an  inner  bound  to  the  caching  capacity  region  of  a  wireless 
network.  Here  we  describe  the  communication  scheme  achieving  this  inner  bound. 
The  matching  outer  bound  shows  that,  for  a  >  6,  this  scheme  is  optimal  in  the  scaling 
sense. 

The  communication  architecture  uses  the  same  three  layer  structure  introduced  in 

4.3  for  unicast  traffic.  Recall  that,  from  high  to  low  level  of  abstraction,  these  layers 
are  the  routing  layer,  cooperation  layer,  and  physical  layer.  Out  of  these  three  layers, 
only  the  routing  layer  needs  to  be  adapted  for  the  transmission  of  caching  traffic. 

From  the  view  of  the  routing  layer,  the  wireless  network  consists  of  the  noiseless 
capacitated  tree  graph  G  (see  Section  6.1.1  and  Figure  4-3).  To  send  a  message  at 
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Figure  6-4:  Example  operation  of  routing  layer  of  the  three-layer  architecture  for 
caching  traffic. 

the  caches  U  to  its  destination  w,  the  routing  layer  routes  the  message  over  G.  The 
optimal  requests  of  message  parts  from  the  caches  in  U  (i.e.,  optimal  cache  selection) 
are  found  by  solving  the  linear  program  (6.1).  As  pointed  out  in  Section  6.1.2,  this 
optimal  cache  selection  can  be  performed  efficiently  by  a  distributed  algorithm.  Note 
that  this  contrasts  with  the  routing  operation  in  the  unicast  and  multicast  cases.  For 
unicast  and  multicast  traffic,  routing  was  trivial,  whereas  optimal  routing  for  caching 
traffic  is  more  complicated. 

The  next  example  illustrates  the  operations  of  the  routing  layer  under  caching  traf¬ 
fic.  For  more  details  on  this  architecture  (in  particular  the  cooperation  and  physical 
layer),  see  Section  4.3. 

Example  6.4.  We  consider  a  single  ( U,w )  pair.  Here,  the  set  of  caches  U  consist  of 
the  nodes  {ui,u2,u3}  in  the  wireless  network  and  their  destination  w  is  in  the  top 
right  of  the  network,  as  shown  in  Figure  6-4.  Optimal  cache  selection  is  performed 
by  solving  a  linear  program.  Note  that,  in  general,  each  cache  delivers  only  a  part  of 
the  message  (i.e.,  multiple  caches  are  involved  in  communicating  the  message  to  its 
destination).  0 

6.4  Proofs 

This  section  contains  the  proofs  of  Theorems  6.1,  6.2,  and  6.3.  We  start  in  Section 
6.4.1  with  some  auxiliary  results.  Sections  6.4.2,  6.4.3,  and  6.4.4  contain  the  proofs 
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of  Theorems  6.3,  6.1,  and  6.2,  respectively. 


6.4.1  Auxiliary  Results 

In  this  section,  we  define  some  quantities  needed  in  several  of  the  proofs. 

We  first  introduce  a  “dual”  description  of  the  various  regions.  Recall  that  for  any 
caching  traffic  matrix  ACA  £  R^Txn 

PAca(u)  =  sup  {p  >  0  :  p\CA  £  ACA(n)}. 

Define  similarly 

Paca(u)  =  sup  {p  >  0  :  p\CA  £  ACA(n)}. 

Consider  a  caching  traffic  matrix  ACA  £  xn  for  the  wireless  network  and  note  that 
ACA  can  equivalently  be  treated  as  a  traffic  matrix  between  the  leaf  nodes  of  the  graph 
G  introduced  in  Section  6.1.  Let  A^A(n)  C  M+nxn  be  the  collection  of  such  caching 
traffic  matrices  ACA  £  R+  xn  that  can  be  transmitted  over  G  using  routing.  Note  that 
0aca(u)  as  defined  through  the  linear  program  (6.1)  is  equal  to 

0aca(u)  =  sup  {(/>  >  0  :  (j) Aca  £  A^.A(n)}. 

It  can  be  shown  that  the  regions  ACA(n),  ACA(n),  and  A^.A(n)  are  convex,  and  hence 
knowledge  of  paca(u),  paca(u),  and  0aca(u)  for  every  ACA  £  M^xn  is  sufficient  to 
completely  describe  them. 

Finally,  recall  the  definition  of  the  set  V(n)  of  node  placements  V(n)  that  satisfy 
certain  regularity  conditions  as  defined  in  Section  4.4.1. 

6.4.2  Proof  of  Theorem  6.3 

We  first  prove  the  upper  bound,  i.e., 


</>aca  <  pA ca. 


(6.3) 
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Note  that  if  ACA  G  AqA  then  there  exists  a  strategy  to  route  traffic  at  rates  ACA  over  G. 
This  implies  that  the  flow  across  each  cut  S  C  Vq  must  be  less  than  the  capacity  of 
that  cut.  The  flow  across  such  a  cut  S  contains  at  least  all  those  requested  messages 
that  have  all  their  caches  in  S  and  their  destination  in  Sc,  i.e., 


U<zSrV(n)  wev(n)\s 

On  the  other  hand,  the  capacity  of  the  cut  S  is  equal  to 


( u,v)eEc : 
uES,v£S 


Thus  Aca  G  AqA  implies 


y:  ^ 

UaSrV{n)  w&V(n)\S 


(u,v)eEc'- 

uES,v(£S 


and  hence  ACA  G  ACA.  Therefore  AqA  C  ACA,  from  which  (6.3)  follows. 

We  now  prove  the  lower  bound,  i.e.,  we  show  that  there  exists  b3 (n)  >  such 

that  for  any  ACA 

0 ACA  >  b3(n)px CA.  (6.4) 

Pick  any  ACA.  Since  for  any  b  >  0, 


06A«  =  -<f>  aca, 

1  . 

PbACA  —  ^PACA, 


we  may  assume  without  loss  of  generality  that 


E  AgA„  =  1.  (6.5) 

(U,  w) 

Recall  that  G  is  an  undirected  capacitated  graph.  Construct  a  directed  capacitated 
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Figure  6-5:  Construction  of  the  directed  graph  G  from  the  undirected  graph  G. 


graph  G  =  (Vq,  Eq)  as  follows.  Take  the  undirected  graph  G  and  turn  it  into  a 
directed  graph  by  splitting  each  edge  e  G  Eg  into  two  directed  edges  each  with  the 
same  capacity  as  e.  Add  2n  additional  nodes  to  Vq,  one  for  each  subset  U  C  V. 
Connect  the  new  node  u  corresponding  to  U  to  each  node  u  E  U  by  a  (directed)  edge 
(u,  u )  with  Cu)U  =  oo.  This  procedure  is  illustrated  in  Figure  6-5.  We  call  the  directed 
version  of  G  that  is  contained  in  G  as  a  subgraph  its  core.  Note  that  if  some  flows 
can  be  routed  through  G  then  the  same  flows  can  be  routed  through  the  core  of  G, 
and  if  some  flows  can  routed  through  the  core  of  G  then  at  least  half  of  each  flow  can 
be  routed  through  G.  ffence,  for  scaling  purposes,  the  two  are  equivalent. 

Now,  assume  we  are  given  a  caching  traffic  matrix  ACA  for  G.  Construct  a  unicast 
traffic  matrix  Auc  for  G  by  making  for  each  (U,  w)  pair  in  G  (i.e.,  U  C  V ,  w  G  V)  the 
node  u  in  G  corresponding  to  U  a  source  for  w  with  rate 

Auc  =  Aca 

/'u,w  AU,w 

Denote  by  A~c  the  set  of  feasible  such  unicast  traffic  matrices  for  G ,  and  set 

fee  —  sup  {(p  >  0  :  0AUC  G  A~c}. 

Since  the  edges  connecting  the  nodes  in  Vq  \  Vq  to  the  core  of  G  are  in  only  one 
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direction,  and  by  the  above  argument  relating  G  to  the  core  of  G,  we  have 


0 ACA  >  -0AUC- 


(6.6) 


We  are  thus  left  with  the  problem  of  analyzing  unicast  traffic  over  G.  Two  dif¬ 
ficulties  arise.  First,  G  is  a  directed  graph.  While  unicast  traffic  over  undirected 
graphs  with  m  nodes  are  well  understood  and  0(log(m))  approximation  results  for 
the  capacity  region  of  such  graphs  in  terms  of  cut-set  bounds  are  known  [34],  the 
best  known  approximation  result  for  general  directed  graphs  is  (up  to  polylog  fac¬ 
tors)  0(mu Z23)  [2],  Second,  the  graph  G  is  exponentially  big  in  n.  More  precisely, 
\Vq\  >  2".  Hence  even  a  logarithmic  (in  the  size  m  of  the  graph)  approximation 
result  will  only  yield  a  polynomial  approximation  in  n.  Nonetheless,  as  we  shall  see, 
the  special  structure  of  G  can  be  exploited  to  obtain  log(n)  approximation  results  of 
A«c 


We  use  an  idea  from  [22],  namely  that  the  unicast  traffic  problem  can  be  reduced 
to  a  maximum  sum  rate  problem.  More  precisely,  for  a  subset  F  C  Vq  x  Vq  of  (u,  w) 
pairs  in  G,  define  the  maximum  sum  rate  as 

ap  =  sup  :  ^UC  eAf}- 

We  now  argue  that  for  every  unicast  traffic  matrix  Auc  there  exists  F  such  that  ap 
is  not  too  much  bigger  than 

First,  note  that  (p^uc  is  the  solution  to  the  following  linear  program 

maximize  <p 

subject  to  Epeiy,™  h  >  fXu% 

S peP-.eepfp  —  Ce 

fp  >0 


V  u,  w  G  Vq, 
Ve  G  Eq, 

V  Pe  P, 


(6.7) 
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where  Pu.u,  is  the  collection  of  all  paths  in  G  from  node  u  to  node  w,  and 


P=  U  Pu, 

C u,w)evsxvs 

The  corresponding  dual  linear  program  is 


minimize 

See  Eg  ceme 

subject  to 

EeePme 

A  dUjW 

V  u,  w  G  VG,P  G  Pu,w, 

V  rl  \UC 

£—iu,w€.Vq  Uu> “N w 

>  1 

rne 

>  0 

V  e  G  Eg, 

du,w 

>  0 

V  u,w  G  Vq. 

Since  the  all- zero  solution  is  feasible  for  the  primal  program  (6.7),  strong  duality 
holds. 

Second,  dp  is  the  solution  to  the  linear  program 


maximize 

S(u,ui)ei7’  SpePUjW  fp 

subject  to 

^-^pGPieGp  f P 

<  ce  Ve  G  Eq, 

fp 

>0  V  p  G  P, 

and  its  dual  is 

minimize 

eG  Ee  ce’me 

subject  to 

J^eGp  —  du,w 

V  u,w  G  Vq,  p  G  Pu, 

du,w  —  1 

V  (u,  w)  G  F, 

me  >  0 

V  e  G  Eg, 

dUjW  A  0 

V  u,w  G  Vg. 

Again  strong  duality  holds. 

Let  {rn*e}e£Eg,  {du,w}u,wev5  be  a  minimize!'  for  the  dual  (6.8)  of  the  unicast  traffic 
problem.  We  now  show  how  {ml},  {d*u  can  be  used  to  construct  a  solution  to  the 
dual  (6.9)  of  the  maximum  sum  rate  problem.  Note  that  we  can  assume  without  loss 
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of  optimality  that 


1 0  if  =  0, 

{  minPePu,w^eepm*e  else- 


(6.10) 


Now,  since  ce  =  oo  whenever  e  G  Eq  \  EG)  we  have  rn*  =  0  for  those  edges.  Since  in 
addition  A^,  >  0  only  if  u  G  Vq  \  Vq,  this  implies  that  {d^  w}u,wev^  can  take  at  most 
n2  different  nonzero  values.  Order  these  values  in  decreasing  order 


d\  >  d*2  >  . . .  >  d*K  >  d*K+1  =  0 


with  K  <  n2,  and  dehne 

\UC  a  V"  \UC 

Ak  —  Au,W 

u,weV^:d*tW=dl 

We  now  argue  that  d*k  <  n2  for  all  k  G  {1, . . . ,  A'}.  In  fact,  assume  d\  >  n2,  then 
by  (6.10)  there  exists  at  least  one  edge  e  such  that  >  n.  Hence 

'Y,  ceml  >  Ceinl  >  n 

e£EG 

since  ce  >  1  for  all  e  G  Eq.  On  the  other  hand,  let  me  =  1  for  all  edges  between  the 
leave  nodes  and  parent  nodes  in  the  core  of  G ,  and  let  me  =  0  for  all  other  edges.  Set 
dUjW  as  in  (6.10)  but  with  respect  to  {me}.  Since  all  paths  between  node  pairs  ( u,w ) 
such  that  >  0  include  at  least  one  edge  between  the  aforementioned  leave  and 
parent  nodes,  we  have 


rl  XUC 

UjU,W/'u,W 


u,w£Vq 

by  the  normalization  assumption  (6.5). 
(6.8),  and  has  value 


>  E  = 

u,w &Vq 

Thus  {me},  {dUjV}  is  feasible  for  the  dual 


Y  ceme  =  n  <  Y  ce'm*e, 
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contradicting  the  optimality  of  {m*},  {d*u  Hence  d*k<  d*x<  n 2  for  all  k. 


We  now  argue  that  at  least  one  d*k  is  not  too  small.  Let  k\  <  &2  <  . . .  <  ki  be 
such  that 


{ki}i=1  =  \k:  Xk  >^\- 


(6.11) 


Note  that  I  >  1  since  otherwise 


K+ 1 

v 

u,w£Vq  k= 1 

<  (A'  +  1) 


2n4 


< 


rr  +  1 


2n4 


<  1, 


contradicting  the  normalization  assumption  (6.5).  Finally,  define  for  i  <  I, 


l=i 

Using  that  {d£}  is  feasible  for  the  dual  (6.8),  that  d*k  <  n2,  and  that  K  <  n2,  we  have 


y4Auc  >  i 


uc 

fc 


i=l 


>  1  - 
1 

>  -. 

“  2 


fc:A^c<l/2n4 

1  ATn2 


2n4 


(6.12) 


We  argue  that  this  implies  existence  of  i  such  that 


dt  > 


2sj(l  +  ln(2n4)) 


(6.13) 
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Indeed,  assume  (6.13)  is  false  for  all  i.  Then 


I  \UC 


uc 

hi 


< 


1=1 


_ i _ \  ^ 

2(1  +  ln(2n4))  st 


i  +  E 


< 


< 


2(1  +  ln(2  n4)) 
1 

2(1  +  ln(2n4)) 
1 

2(1  +  ln(2n4)) 
1 

2(1  +  ln(2n4)) 
1 

2’ 


i=2 

I 


1  +  y:  (ln(si)  -  ln(sj_i)) 

i=2 

(1  +  ln(s7/A^c))) 

(l  +  ln(2n4)) 


(6.14a) 

(6.14b) 

(6.14c) 


where  we  have  used  that  /  >  1  in  (6.14a),  that  1  —  x  <  —  ln(x)  for  every  x  >  0  in 
(6.14b),  and  that  s7  <  1  by  (6.5)  and  A^0  >  in  (6.14c).  This  contradicts  (6.12), 
showing  (6.13)  must  hold  for  some  i.  Consider  this  value  of  i  in  the  following. 


Now,  consider  the  following  set  F  of  (u,  w )  pairs: 


F  4  {(«,«.)  :<„><}• 


Note  that  by  (6.10)  we  have  that  if  (u,w)  G  F  then  u  G  Vg\Vc  and  w  is  a  “leaf” 
node  in  Vq.  In  other  words,  (u,  w)  pairs  in  F  correspond  to  caches-destination  pairs 
(U,  w )  in  G.  Set  for  all  u,  w  G  Vq  and  e  G  Eg, 


i  A_ 

UJv,.w  — 


dt 


m : 


rriP  = 


% 


Note  that  for  (u,  w)  G  F, 


j  _  U,W  ^  1 

U'u,w  _  -L) 
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1 


and  that  for  all  u,  w  G  Vq,  p  G  Pu,w, 


J2'me 

e£p 


> 


d 


ki 


E 


eSp 


—  d* 
dtd« 


d. 


ll,wi 


m 


* 

e 


by  feasibility  of  {d*u  w }  and  {m*}  for  the  dual  (6.8).  Hence,  for  this  F,  the  choice  of 
{me}  and  {du,w}  is  feasible  for  the  dual  (6.9). 


By  weak  duality 


d 


E 

ki  eS E* 


cem  . 


By  (6.13), 

d*  > _ I _ 

ki  ~  2sj(l  +  ln(2n4))  ’ 

and 


i 


3= 1 


=  E  E 


<  T  Auc 

—  /  y  u,u 

{u,w):d*tW>d,%. 


=  Ey“« 

(■ u,w)EF 


A  \UC 
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where  in  the  third  line  we  have  used  that  d*k  >  d*k2  >  . . ..  Therefore 

dp  <  2A~C(1  +  ln(2n4))  ^  cem*e. 

eGEG 

Since  {rn*}  is  optimal  for  the  dual  (6.8),  and  by  strong  duality,  we  also  have 


cern*e  —  4> AUC, 


and  hence 


0AUC  — 


(Ti 


2(l  +  ln(2n4))  yuc' 


(6.15) 


We  are  thus  left  with  analyzing  maximum  sum  rates  dp.  Now  notice  that  since 
the  edges  in  Eq  \  Eq  have  infinite  capacity,  and  since  for  (u,  w)  G  F  we  can  assume 
without  loss  of  generality  that  u  G  Vg  \  Vq,  this  analysis  can  be  done  by  considering 
only  the  core  of  G.  More  precisely,  for  a  collection  of  node  pairs  F  in  G  as  above, 
we  construct  a  collection  of  node  pairs  F  in  G  as  follows.  For  each  (u,  w)  G  F  with 
u  connected  by  G  with  nodes  U  C  Vq  C  Vg,  add  (u,  w)  to  F  for  each  u  6  h .  Denote 
by  <7p  the  maximum  sum  rate  for  F  in  G.  Since  G  is  the  undirected  version  of  the 
core  of  G,  we  have 

dp  >  of-  (6.16) 


For  a  collection  of  node  pairs  F  in  G,  we  call  a  set  of  edges  M  a  multicut  for  F 
if  in  the  graph  (Vq,  Eg  \  M )  each  pair  in  F  is  disconnected.  For  a  subset  M  C  EG, 
define 

Cm  ^  ^  ce. 

eSM 

It  is  shown  in  [13,  Theorem  8]  that  if  G  is  an  undirected  tree,  then  for  every  F  G 
Vq  x  Vq  there  exists  a  multicut  M  for  F  such  that 


1 

<Tf  >  - cm- 


(6.17) 


Combining  (6.15),  (6.16),  and  (6.17),  we  obtain  that  for  every  Auc  there  exists  a 
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collection  of  node  pairs  F  in  G,  and  a  multicut  M  for  the  corresponding  F  in  G  such 


that 


4> auc  > 


1  Cm 

4(1  +  ln(2n4))  A~c ' 


(6.18) 


We  now  show  how  the  edge  cut  M  C  Eg  can  be  transformed  into  a  node  cut 
S  C  Vg-  Denote  by  (5)}  the  connected  components  of  ( Vg,Eg  \  M).  We  have 


y^C(5fx5pngG  <  2 cM,  (6.19) 

i 

since  (Si  x  Sf)  fl  Eq  C  M  for  every  i,  and  since  every  edge  in  M  appears  at  most  twice 
in  the  sum  on  the  left-hand  side.  With  slight  abuse  of  notation,  define  for  S  C  Vq 


\  CA  A  \ "  \ "  \  CA 

As,sc  —  Au’w' 

ucsnv  wev\s 

M  is  a  multicut  for  the  F  induced  by  F,  and  hence  for  every  (u,  w)  G  F  and  the 
corresponding  pair  (U,w),  M  separates  w  from  all  the  nodes  in  U.  Therefore,  for 
each  such  ( U,w )  pair,  there  exists  a  St  such  that  w  G  Si,  UC\Si  —  0.  This  shows  that 


(6.20) 


Equations  (6.18),  (6.19),  and  (6.20)  imply  that  there  exists  j  such  that 


^Auc  — 


> 


Si  c(S9xSi)nEG 


8(1  +  ln(2n4))  Si  xs? & 

1  C^xS^nEc 


> 


8(1  +  ln(2n4))  A  g£s 
1 


nun 


C(SxSc)nEG 


(6.21) 


8(1  +  ln(2n4))  s cvG  A^c 

1 

8(l  +  ln(2n4))PACA' 
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Combined  with  (6.6),  this  shows  that  for 


we  have 


h(n)  = 


16(1  +  ln(2n4)) 


>  n 


-0(1) 


0ACA  >  b3(n)px CA, 


proving  the  lower  bound  in  Theorem  6.3. 


6.4.3  Proof  of  Theorem  6.1 

In  this  Section,  we  provide  the  proof  of  Theorem  6.1.  Instead  of  proving  the  theorem 
directly,  it  will  be  convenient  to  work  with  the  dual  description  pxo\(n)  and  pxca(u) 
introduced  in  Section  6.4.1.  The  next  theorem  proves  the  dual  version  of  Theorem 
6.1. 

Theorem  6.4.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  b\  = 
n-o(i)  suc/j  piat  probability  1  —  o(l)  as  n  — >  oo  for  any  n  and  any  caching  traffic 
matrix  ACA  G  xn 

bi{n)pxc*(n)  <  pxca (n) . 

Proof.  The  same  construction  as  in  the  proof  of  Theorem  4.10  shows  that  there  exists 
b(n )  >  n~°^  such  that  if  a  caching  traffic  matrix  ACA  can  be  routed  over  G,  then 
b{n)  ACA  can  be  communicated  reliably  over  the  wireless  network.  Formally,  if  V  G  V 
then  under  fast  fading 

b(n)<j> xca  <  px ca,  (6.22) 

and  the  same  results  holds  for  slow  fading  for  a  collection  of  channel  gains  hi  (not 
dependent  on  ACA)  with 


IP(  {  huv  }  u,vgv  eH)>  l-o(l) 


as  n 


oo. 
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Combining  (6.22),  with  Theorem  6.3  and  Lemma  4.5,  we  obtain  that  with  prob¬ 
ability 

n{K,v}en,VeV)  >l-o(l) 

as  n  —>  OG,  we  have  for  any  caching  traffic  matrix  ACA 

P\ ca  >  b(n)<f>x ca 

>  b(n)b3(n)px ca. 


Setting 


bi(n)  =  b(n)b3(n), 


and  recalling  that  63 (n)  >  n  0 ^  and  6(n)  >  n  both  uniformly  in  ACA,  concludes 
the  proof  of  Theorem  6.4.  □ 


6.4.4  Proof  of  Theorem  6.2 

In  this  Section,  we  prove  Theorem  6.2.  As  before,  it  will  be  convenient  to  work 
with  the  dual  description  paca(ti)  and  /5aca(ti)  introduced  in  Section  6.4.1.  The  next 
theorem  proves  the  dual  version  of  Theorem  6.2 

Theorem  6.5.  Under  either  fast  or  slow  fading,  for  any  a  >  2,  there  exists  b2  <  n 0(^ 
such  that  with  probability  1  —  o(l)  as  n  — >  00  for  any  n  and  any  caching  traffic  matrix 
ACA  G  M+ xn 

PAca  (n)  <  b2  (n)pAcA(n). 

We  start  with  some  auxiliary  lemmas.  For  a  subsets  S\,S2  C  V(n),  denote  by 
C(Si,S2)  the  MIMO  capacity  between  the  nodes  in  Si  and  S2.  Moreover,  denote  by 
S'!  the  nodes  in  S2  that  are  at  distance  between  k  and  k  +  1  from  Si,  i.e., 

=  {v  G  S2  :  min  ru  v  G  [k,  k  +  1)}. 

u£Si 

Lemma  6.6.  Under  either  fast  or  slow  fading,  for  every  a  >  6,  there  exists  a  constant 
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K\  such  that  for  all  V(n)  G  V(n)  and  all  Si,  S2  C  V(n),  S2  (1  Si  =  0, 


log(»l) 

C(Si,S2)<Ki\og\n )  J2\S*\. 

k= 0 


Proof.  Note  that 


Let 


S2  =  U  si 


k= 0 


He 


Si,S2  [hu,v\u£Si,v£S2 


be  the  matrix  of  channel  gains  between  the  nodes  in  Si  and  S2.  Linder  fast  fading 


C'(5'1,52)  =  max  E  logdet  (/  +  H^S2Q(H) HSus2)  ), 

Q\H)>  0:  \  7 

E( qu,u)<P  VueS  1 


and  under  slow  fading 


C(S,  Sc )  =  max  log  det  ( J  +  HluSaQHSuSa)  ■ 

qu,u<P  v«eSi 


Applying  the  generalized  Hadamard  inequality,  we  obtain  that  under  either  fast  or 
slow  fading 

C(Si,  S2)  <  C(S,  U gn)S2fc)  +  C(£,  Ufe>log(n)^2fc).  (6.23) 


For  the  first  term  in  (6.23),  using  Hadamard’s  inequality  once  more,  yields 


log(n) 


^d(SbW) 

fc=o  veS \ 
log  (n) 

<  E  EW’W)- 

fc=0  vdSti 


By  Lemma  4.6, 


C({v}c,  {u})  <  K  log(n), 
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and  thus 


log(n) 

C(SU ug<n)S2l)  <  K log(n)  V  |S‘|.  (6.24) 

k= 0 

For  the  second  term  in  (6.23),  we  have  the  following  upper  bound  from  (slightly 
adapting)  Theorem  2.1  in  [21]: 

C(s,,ut>l08ws‘)<  ]T  E(Er»h2)2- 

fc>log(n)  v€S$  ueSi 

Consider  v  G  S'*.  By  the  dehnition  of  S'*,  the  (open)  ball  of  radius  k  around  v  does 
not  contain  any  node  in  Si.  Moreover,  since  V  G  V,  there  are  at  most  log(n)  nodes 
inside  every  square  of  sidelength  one.  Thus 

OO 

rn  “/2  <  4tt(A;  +  2)2  log(n)A;-a/2  +  log(n)  10vr(A;  +  2 )k~a/2 

we Si  k=2k 

<  K  log (n)k2~a/2, 

for  some  constant  K  independent  of  Si  and  k.  Therefore, 

C(SU  Ufe>log(n)S2*)  <  \Sk2\K2  log2  (n)kA~a.  (6.25) 

fc>log(n) 

Consider  now  some  v  G  S'!  with  k  >  log(n),  and  let  u*  be  the  closest  point  in  Si 
to  v.  Since  v  G  S'*,  we  must  have 

T u*  ,v  [/c ,  k  1). 

Consider  the  (open)  ball  of  radius  ru*tV  around  v  and  the  ball  of  radius  log(n)  around 
u*.  Since  u*  is  the  closest  node  to  v  in  S\ ,  all  nodes  in  the  ball  around  v  are  in  S'2. 
Moreover,  the  intersection  of  the  two  balls  has  an  area  of  at  least  ^log2(n).  Since 
V  G  V,  this  implies  that  this  intersection  must  contain  at  least  one  point,  say  v,  and 


198 


by  construction 


log(n) 

V  e  U  sl 

k= 0 

This  shows  that  for  every  node  v  in  Sf  there  exists  a  node  v  in  such  that 

rv,a  e[k-  log  (n),  k+  1). 

Now,  since  V  E  V,  for  every  node  v ,  there  are  at  most 

2n(k  +  l)(log(n)  +  6)  log(ra)  <  K'klog2(n ) 

nodes  at  distance  [fc  —  log(n),  k  + 1)  for  some  constant  K' .  Hence  the  number  of  nodes 
in  S*  is  at  most 

log(n) 

\Sk2\<K'k\og2(n)  (6.26) 

k= 0 

Combining  (6.26)  with  (6.25)  yields 

C{S^k>log{n)Sk2)  <  \Sk\K2  log2  (n)k4~a 

fc>log(n) 

log(n) 

<K'K*  logV)(£>j|)  X  k^°  (6.27) 

k= 0  fc>log(n) 

l°g(n) 

=  /i"log4(n) 

k= 0 

for  some  constant  K" ,  and  where  we  have  used  that  ck  >  6.  Finally,  substituting 
(6.24)  and  (6.27)  into  (6.23)  shows  that 

log(re) 

C{S^  S2)  <  (. K  +  K")  log4(n)  X]  \S%\, 

k= 0 

which  proves  the  lemma  with 

Kx  =  K  +  K".  □ 
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The  next  lemma  shows  that  for  large  path-loss  exponents  (a  >  6)  every  cut  is 
approximately  achievable,  i.e. ,  for  every  cut  there  exists  an  achievable  unicast  traffic 
matrix  that  has  a  sum  rate  across  the  cut  that  is  not  much  smaller  than  the  cut 
capacity. 

Lemma  6.7.  Under  fast  fading,  for  every  a  >  6,  there  exists  b4(n)  <  n such  that 
for  V(n)  G  V(n)  and  S  C  V(n)  we  can  find  a  unicast  traffic  matrix  Auc  G  Auc(n) 
satisfying 

C(5,5c)<64(n)^^A“c„.  (6.28) 

u£S  w^S 

Moreover,  there  exists  a  collection  of  channel  gains  7i(n)  such  that 

^{{hu,v}u,vev(n)  e  H(n))  >  1  -  o(l) 

as  n  —a  oo,  and  such  that  for  {hUjV}UtV  G  hifn),  (6.28)  holds  for  slow  fading  as  well. 
Proof.  By  Lemma  6.6  for  V  G  V 

C(S,  Sc )  <  Kx  log4(n)|{n  G  Sc  :  rs,v  <  log(n)  +  1}|,  (6.29) 


where 

rs,v  —  min  r  uv . 

uGS 

Construct  a  unicast  traffic  matrix  Auc  G  M™xn  as 

uc  a  I  ^(n)  if  r'u’w  <  l°s{n)  +  1, 

/'u,w  \ 

I  0  else, 

for  some  function  p{n).  We  now  argue  that  for  p(n)  =  @(log^2(n))  there  exists 
b(n)  >  such  that  b(n) Auc  G  Auc.  This  follows  from  Theorem  4.1,  once  we  show 

that  for  every  i  G  {1, . . . ,  L(n)}  U  (log(n)}  and  i  G  {1, . . . ,  4£}  we  have 

<max{l,4^n}2'min{3’a}/2, 

u€Ve:i  w£V(ti 
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and 


E  E  CSmax{l,rV”M,!, 

u£Ve,i  wev^i 


Y  A«,«>  <  K  log 2(n)p(n)  Mw  G  V, 

U^W 

Y  AS  <  K  log 2{n)p{n)  Mu  G  V, 

W^U 

for  some  constant  K.  By  the  locality  of  the  traffic  matrix  Auc,  this  is  sufficient  to 
show  that  Theorem  4.1  applies  for  pin)  =  log_2(n).  Hence  b(n) Auc  G  Auc  for  fast 

fading,  and  the  same  conclusion  holds  for  slow  fading  for  some  Tt  with 

I ^({hu,v}u,v€V  G  7~t)  El-  o(l) 


as  n  — >  oo. 

Combined  with  (6.29),  this  implies  that 


C(S,  Sc )  < 


A' log6  (n) 
b(n ) 


E  E  A"« 

WES'  w^S 


proving  the  lemma. 


□ 


We  are  now  ready  for  the  proof  of  the  outer  bound  on  ACA(n). 


Proof  of  Theorem  6.5.  Consider  a  cut  S  C  V  in  the  wireless  network.  Assume  we 
allow  the  nodes  on  each  side  of  the  cut  to  cooperate  without  any  restriction  —  this 
can  clearly  only  increase  P\ca.  The  total  amount  of  traffic  that  needs  to  be  transmitted 
across  the  cut  is  then 

Y  S  ^v 

WcSvfS 

The  maximum  achievable  sum  rate  (with  the  aforementioned  node  cooperation)  is 
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given  by  C(S,SC )  the  MIMO  capacity  between  the  nodes  in  S  and  in  Sc.  Therefore 


Paca  <  nrin 


C(S,  Sc 


E, UcS  XU% 


(6.30) 


We  proceed  by  relating  the  cut  S  in  the  wireless  network  to  a  cut  S  in  G.  By 
Lemma  6.7  for  be  V,  there  exists  Auc  G  Auc  such  that  for  fast  fading 

C(S,S‘)<64(n)E)EA!5”  (6.31) 

uG:S  w^S 

and  (6.31)  holds  also  for  slow  fading  if  {hU;V}UtV  G  7i  (with  7i  defined  as  in  Lemma 
6.7).  By  Theorem  4.2,  for  a  >  5  and  be  V  there  exists  K  such  that  if  Auc  €  Auc 
then  iblog-6(n)Auc  G  A^c. 

Now,  consider  any  S  C  bg  such  that  S  fl  V  =  S.  Note  that  S'  is  a  cut  in  G 
separating  S  from  V  \  S.  Since  K  log_6(n)Auc  G  A^c,  we  thus  have 

K  l°z~6(n)xu%  <  °u .«> 

u£S  w(£S  ( u,v)£Eg ■ 

u£S,v(£S 

and  by  minimizing  over  the  choice  of  S, 


EEA'loe"f(")^A 

uSS  ui^5 


_  nrin 

S:Snv=s 


E 

(u,v)eEG: 

U£S,V^S 


Cu,v 


(6.32) 


Combining  (6.31)  and  (6.32)  shows  that 


C(S,  Sc )  < 


64  (n) 
Jl 


log6 


in 


__  nun 
S:SnV=S 


E  • 

(u,v)eEg- 

U€:S,V(f:S 


Together  with  (6.30),  and  using  Lemmas  4.5  and  6.7,  this  yields  that  with  prob¬ 
ability 


^{{hu,v}u,v  £  77,  V  G  V)  >  1  —  o(l) 
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as  n  — >  oc,  we  have  for  any  caching  traffic  matrix  A 


CA 


z  .  C(S,  Sc ) 

pxCA  ~  'lev  V - V - ACA- 

l^UcS  2^W(£S  AU,w 


<  b2(n)  min  _  min 


A  7  ( u.v )  G  Kn :  Cu,?; 
u£S,v$:S 


~  ~  v-^,  \  r  a 

's'cV'  SeVG:Snv=s  Z^c/cSnv  l^wev\s  Au,w 
^1(u,v)£Eq:  Cu,v 


=  bo  in)  min 


u£S,v^S 


ScvG  J2ucSrv  ^2w£V\s  ^u% 
=  h{n)pxcA, 


with 

b2(n)  4  M^iog6(n)  <  no(1). 
K 


□ 


6.5  Discussion 

Here  we  discuss  extensions  and  implications  of  the  results  presented  in  this  chapter. 
In  Section  6.5.1  we  consider  dense  networks,  and  in  Section  6.5.2  we  discuss  design 
guidelines. 

6.5.1  Dense  Networks 

The  results  presented  so  far  in  this  chapter  assumed  extended  node  placement.  Here 
we  discuss  how  these  results  can  be  modified  for  the  dense  case,  where  n  nodes  are 
located  on  a  square  of  area  one. 

As  was  the  case  for  unicast  and  multicast  traffic  (see  chapters  4  and  5),  achievabil- 
ity  for  dense  networks  (regardless  of  the  value  of  path-loss  exponent)  can  be  derived 
from  the  achievability  result  for  extended  networks  by  taking  a  limit  as  a  — »  2.  This 
shows  that  for  dense  networks,  for  any  a  >  2,  there  exists  b\(n)  >  n~°^  such  that 
with  probability  1  —  o(l) 

&i(n)ACA(n)  C  A CA(n) 
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with 


^  ^  - 

UcSrV(n)  wGV(n)\S 


J2  cu,v  VS  c  Vg 

( u,v)£Eq : 
u£S,v<£S 


and  for  an  edge  e  at  level  t  in  G, 


{4  en  if  1  <  £  <  L(n), 
1  if  £  =  L(n)  +  1. 


For  the  converse  result,  we  need  to  replace  Lemma  6.6  by  the  inequality 

C(S,SC )  <  K log(n)  minds'!,  \SC\} 

for  some  constant  K ,  and  we  need  to  modify  Lemma  6.7  accordingly.  From  this,  we 
obtain  that  for  dense  networks,  for  any  a  >  2,  there  exists  62  W  <  n° ^  such  that 
with  probability  1  —  o(l) 

ACA(n)  C  b2(n)Ac\n). 

This  is  a  stronger  result  than  the  corresponding  upper  bound  for  extended  networks, 
which  was  only  shown  to  hold  for  a  >  6. 

In  other  words,  for  dense  networks,  for  any  value  of  path-loss  exponent  a  >  2,  we 
have  ACA(n)  =  n±°^1^ACA(n).  Furthermore,  in  the  dense  setting,  the  expression  for 
ACA(n)  can  be  simplified  to 

Ac»  =  |aca  e  R2;*”  :  5]  5]  A“  <  mm{|S|,n  -  |S|}  VS  c  V(n) 

Note  that  here  we  only  consider  subsets  of  V(n)  (as  opposed  to  Va{n)).  Hence,  for 
the  dense  case  ACA(n)  provides  a  complete  scaling  characterization  of  the  2”  x  n- 
dimensional  caching  capacity  region  ACA(n)  in  terms  of  @(2n)  cuts  (as  opposed  to 
@(4n)  cuts  necessary  for  the  description  of  A CA(n)  for  the  extended  case). 
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6.5.2  Design  Guidelines 

The  results  presented  in  this  chapter  suggest  the  following  design  guidelines  for  uti¬ 
lizing  caches  in  large  wireless  networks.  First,  as  we  have  seen  in  Example  6.2,  the 
strategy  of  selecting  only  the  closest  cache  can  be  quite  bad  in  general.  Instead,  the 
cache  selection  has  to  be  done  in  a  load-balanced  fashion.  In  our  proposed  three-layer 
architecture,  cache  selection  is  performed  by  solving  a  linear  program.  Moreover, 
we  argued  that  the  solution  to  this  linear  program  can  be  found  efficiently  by  a 
distributed  algorithm.  On  the  other  hand,  when  the  cache  location  and  traffic  are 
already  balanced  enough  (as  is  the  case  in  Example  6.3),  this  load  balancing  can  be 
omitted,  and  closest  cache  selection  can  indeed  be  close  to  optimal. 

Second,  the  results  suggest  that,  at  least  in  the  large  path-loss  regime,  the  problem 
of  optimal  cache  selection  can  be  solved  at  the  routing  layer.  In  other  words,  the 
physical  and  cooperation  layer  are  not  affected  by  the  introduction  of  caches. 

6.6  Chapter  Summary 

We  analyzed  the  influence  of  caching  on  the  performance  of  wireless  networks.  Our 
approach  is  information-theoretic,  yielding  a  scaling  characterization  of  the  complete 
caching  capacity  region  in  the  high  path-loss  regime  a  >  6.  Even  though  this  region 
is  2n  x  n  dimensional  (i.e. ,  exponential  in  the  number  of  nodes  n  in  the  wireless 
network),  we  present  an  algorithm  that  checks  approximate  feasibility  of  a  particular 
caching  traffic  matrix  efficiently  (in  polynomial  time  in  the  description  length  of  the 
caching  traffic  matrix).  Achievability  is  proved  using  a  three-layer  communication 
architecture  achieving  the  entire  caching  capacity  region  in  the  scaling  sense  for  a  >  6. 
The  three  layers  deal  with  optimal  selection  of  caches,  choice  of  amount  of  necessary 
cooperation,  noise  and  interference,  respectively.  The  matching  (in  the  scaling  sense) 
converse  proves  that  addressing  these  questions  separately  is  without  loss  of  order- 
optimality  in  the  regime  of  high  path- loss  exponent. 
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Chapter  7 


Conclusions 


This  chapter  contains  concluding  remarks.  In  Section  7.1  we  summarize  the  results 
presented  in  this  thesis.  Section  7.2  contains  pointers  for  future  work. 

7.1  Thesis  Summary 

In  this  thesis,  we  have  considered  the  impact  of  heterogeneities  on  achievable  rates  in 
large  wireless  networks.  Three  types  of  such  heterogeneities  were  discusses  in  detail: 
location  heterogeneity,  traffic  heterogeneity,  and  service  heterogeneity. 

We  analyzed  location  heterogeneity  be  allowing  the  nodes  in  the  wireless  network 
to  be  placed  arbitrarily  (with  a  minimum-separation  constraint).  This  contrasts  with 
the  standard  homogeneity  assumption,  i.e.,  that  nodes  are  placed  independently  and 
uniformly  at  random.  For  the  traffic  model,  we  assumed  random  source-destination 
pairing  with  uniform  rate.  We  have  seen  that  the  impact  of  arbitrary  node  place¬ 
ment  depends  strongly  on  the  path-loss  exponent  a.  For  small  path-loss  exponents 
a  €  (2,  3],  we  showed  that  the  node  placement  has  no  impact  on  achievable  rates  and 
that  global  cooperation  is  necessary  irrespective  of  the  node  placement.  We  proposed 
a  novel  cooperative  communication  scheme  (called  hierarchical  relaying),  and  showed 
that  it  is  order  optimal  for  all  node  placements.  For  large  path-loss  exponents  a  >  3, 
we  have  seen  that  the  node  placement  critically  impacts  achievable  rates  as  well  as 
optimal  communication  schemes.  For  very  regular  node  placements,  multi-hop  corn- 
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munication  is  order  optimal,  for  very  irregular  node  placements,  hierarchical  relaying 
is  optimal.  We  proposed  a  communication  scheme  “interpolating”  between  these  two 
schemes  depending  on  the  regularity  of  the  node  placement,  and  showed  that  this 
scheme  is  order  optimal  under  adversarial  node  placement  with  regularity  constraint. 

For  traffic  heterogeneity,  we  analyzed  the  n2-dimensional  unicast  capacity  region 
Auc(  n) .  In  other  words,  we  allowed  general  unicast  traffic.  This  contrasts  with  the 
standard  homogeneity  assumption,  i.e.,  that  each  node  is  source  exactly  once  for  a 
destination  chosen  independently  and  uniformly  at  random  from  among  all  the  other 
nodes,  and  that  all  these  n  source-destination  pairs  communicate  at  equal  rate.  For 
the  node  placement,  we  assumed  that  nodes  are  place  independently  and  uniformly  at 
random.  We  presented  inner  an  outer  bounds  on  the  unicast  capacity  region  Auc(n). 
These  bounds  coincide  up  to  a  factor  n±0 ^  along  at  least  n2  —n  out  of  n2  dimensions 
(corresponding  to  balanced  traffic)  for  a  G  (2,5],  and  along  all  n2  dimensions  for 
a  >  5.  Hence,  for  a  >  5,  this  provides  a  scaling  characterization  for  the  entire  n2- 
dimensional  unicast  capacity  region  Auc(n),  and  the  same  statement  is  true  along  at 
least  n2  —n  dimensions  for  a  G  (2,  5].  For  the  inner  bound,  we  provided  a  three-layer 
communication  architecture.  The  three  layers  are  the  routing  layer,  the  cooperation 
layer,  and  the  physical  layer.  In  the  routing  layer,  we  perform  load  balancing  - 
dealing  with  the  traffic  heterogeneity.  In  the  cooperation  layer,  we  distribute  and 
concentrate  traffic  over  the  wireless  network  —  choosing  the  appropriate  amount  of 
cooperation.  In  the  physical  layer,  we  implement  this  distribution  an  concentration  of 
traffic  —  handling  interference  and  noise.  The  approximate  optimality  of  this  scheme 
(in  the  sense  mentioned  above)  shows  that  these  problems  can  be  solved  separately 
without  loss  of  order  optimality. 

For  service  heterogeneity,  we  analyzed  the  n  x  2"-dimensional  multicast  capacity 
region  AMC(n)  and  the  2n  x  n- dimensional  caching  capacity  region  ACA(n).  This 
contrasts  with  the  standard  service  homogeneity  assumption,  i.e.,  that  all  demands  are 
unicast.  For  the  node  placement,  we  again  assumed  that  nodes  a  placed  independently 
and  uniformly  at  random.  We  presented  inner  and  outer  bounds  on  AMC(n)  and 
ACA(n).  For  the  multicast  case,  these  bounds  coincide  up  to  a  factor  along 
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at  least  n2n  —  n  out  of  n2n  dimensions  for  a  G  (2,5],  and  along  all  n2n  dimensions 
for  a  >  5.  Hence,  for  a  >  5,  this  provides  a  scaling  characterization  of  the  entire 
n  x  2n-dimensional  multicast  capacity  region  AMC(n),  and  the  same  statement  is  true 
along  at  least  n2n  —  n  dimensions  for  a  G  (2,5].  For  the  caching  case,  these  bounds 
coincide  up  to  a  factor  n±0 ^  for  a  >  6,  yielding  a  scaling  characterization  of  the  entire 
2n  x  n-dimensional  caching  capacity  region  ACA(n)  in  the  high  path-loss  regime.  For 
the  inner  bounds,  we  showed  how  the  three-layer  architecture  proposed  for  general 
unicast  traffic  can  be  modified  to  accommodate  multicast  and  caching  traffic.  In  both 
cases,  only  the  routing  layer  needed  to  be  changed  —  the  cooperation  and  physical 
layer  operate  as  in  the  unicast  case. 

7.2  Future  Work 

There  are  several  directions  for  future  work,  of  theoretical  as  well  as  practical  interest. 

The  following  questions  are  of  theoretical  interest,  aiming  at  broadening  our  un¬ 
derstanding  of  large  heterogeneous  wireless  networks.  First,  note  that  the  converse 
for  location  heterogeneity  in  the  a  >  3  regime  is  only  under  adversarial  node  place¬ 
ment  with  regularity  constraint.  One  direction  would  be  to  find  a  converse  for  any 
node  placement.  Second,  for  both  unicast  and  multicast  traffic,  the  scaling  charac¬ 
terization  holds  for  all  but  n  dimensions  of  the  respective  capacity  regions  in  the 
ol  G  (2,  5]  regime.  A  second  direction  for  future  work  is  to  complete  the  scaling  char¬ 
acterization  for  the  remaining  n  dimensions  in  the  low  path-loss  regime.  Similarly,  for 
caching  traffic,  the  results  are  only  partial  (namely  only  an  inner  bound  is  available) 
for  a  <  6.  Completing  the  picture  for  a  <  6  would  again  be  of  interest.  Third, 
the  results  for  traffic  and  service  heterogeneity  are  derived  assuming  random  node 
placement.  Obtaining  scaling  characterizations  of  the  unicast  or  multicast  capacity 
regions  under  arbitrary  node  placement  would  be  of  interest.  Ultimately,  the  goal  is 
to  develop  a  complete  scaling  description  of  achievable  rates  in  large  heterogeneous 
wireless  networks. 

The  following  questions  and  directions  for  future  work  are  of  practical  interest. 
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First,  most  of  the  architectures  presented  in  this  thesis  are  centralized.  To  be  im- 
plcmentable  in  practice,  decentralized  architectures  need  to  be  developed.  Second, 
issues  of  delay  have  been  completely  ignored  throughout  this  thesis.  Especially  in  the 
broadcast  phase  of  the  hierarchical  relaying  scheme,  delays  can  be  quite  large.  This 
issue  needs  to  be  addressed  before  being  implementable.  Third,  how  to  implement  the 
communication  architectures  proposed  in  this  thesis  within  existing  protocol  stacks 
needs  to  be  addressed.  For  example,  the  distribution  and  concentration  of  traffic  in 
the  cooperation  layer  of  the  three-layer  architecture  for  traffic  heterogeneity  results  in 
growing  header  overhead.  The  design  guidelines  presented  towards  the  end  of  the  var¬ 
ious  chapters  may  be  helpful  in  deciding  which  parts  of  the  communication  schemes 
presented  in  this  thesis  are  crucial  for  optimal  operation  of  large  wireless  networks, 
and  which  parts  can  be  modified  for  simpler  implementation. 
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